VOLUME EDITOR: 



H.WALDMANN 

Bioorgank Chemistry 
of Biological 
Signal Transduction 



Springer 



bpics in Current Chemistry 



211 

Topics in Current Chemistry 



Editorial Board; 

A. de Meijere • K.N. Houk • H. Kessler 

J.-M. Lehn • S.V. Ley • S.L. Schreiber • J. Thiem 

B. M. Trost • F. Vogtle • H. Yamamoto 




Springer 

Berlin 

Heidelberg 

New York 

Barcelona 

Hong Kong 

London 

Milan 

Paris 

Singapore 

Tokyo 




Bioorganic Chemistry of 
Biological Signal Transduction 

Volume Editor: Herbert Waldmann 



With contributions by 

G. Dorman, P. J. Hergenrother, C. Herrmann, 

J. Kuhlmann, A. Levitzki, S.R Martin, G. Muller, 
M. Thutewohl, H. Waldmann 




Springer 




The series Topics in Current Chemistry presents critical reviews of the present and 
future trends in modern chemical research. The scope of coverage includes all areas 
of chemical science including the interfaces with related disciplines such as biology, 
medicine and materials science. The goal of eaxch thematic volume is to give the non- 
specialist reader, whether at the university or in industry, a comprehensive overview 
of an area where new insights are emerging that are of interest to a larger scientific 
audience. 

As a rule, contributions are specially commissioned. The editors and publishers will, 
however, always be pleased to receive suggestions and supplementary information. 
Papers are accepted for Topics in Current Chemistry in English. 

In references Topics in Current Chemistry is abbreviated Top. Curr. Chem. and is 
cited as a journal. 



Springer WWW home page: http://www.springer.de 
Visit the TCC home page at http:/link.springer.de/series/tcc/ 
or http://link.springer-ny.com/series/tcc/ 



ISSN 0340-1022 
ISBN 3-540-67746-1 

Springer- Verlag Berlin Heidelberg New York 



Library of Congress Catalog Card Number 74-644622 

This work is subject to copyright. All rights are reserved, whether the whole or part 
of the material is concerned, specifically the rights of translation, reprinting reuse of 
Qlustrations, recitation, broadcasting, reproduction on microfilm or in any other ways, 
and storage in data banks. Duplication of this publication or parts thereof is only 
permitted under the provisions of the German Copyright Law of September 9, 1965, 
in its current version, and permission for use must always be obtained from Springer- 
Verlag. Violations are liable for prosecution under the German Copyright Law. 

Springer- Verlag Berlin Heidelberg New York 
a member of BertelsmannSpringer Science+Business Media GmbH 
© Springer- Verlag Berlin Heidelberg 2001 
Printed in Germany 

The use of general descriptive names, registered names, trademarks, etc. in this 
publication does not imply, even in the absence of a specific statement, that such 
names are exempt from the relevant protective laws and regulations and therefore 
free for general use. 

Cover design: Friedhelm Steinen-Broo, Barcelona; MEDIO, Berlin 
Typesetting: Fotosatz-Service Kohler GmbH, 97084 Wurzburg 

SPIN: 10649296 02/3020ra - 543210 - Printed on acid-free paper 




Volume Editor 



Prof. Dr. Herbert Waldmann 

Max-Planck-Institut fiir Molekulare Physiologic 

Universitat Dortmund 

Otto-Hahn-Strafie 1 1 

44221 Dortmund, Germany 

E-mail: herbert.waldmann@mpi-dortmund.mpg.de 



Editorial Board 

Prof. Dr. Armin de Meijere 
Institut fiir Organische Chemie 
der Georg-August-Universitat 
Tammannstrafie 2 
37077 Gottingen, Germany 
E-mail: ameijert@uni-goettingen.de 

Prof. Dr. Horst Kessler 
Institut fiir Organische Chemie 
TU Miinchen 
Lichtenbergstrafie 4 
85747 Garching, Germany 
E-mail: kessler@ch.tum.de 

Prof. Steven V. Ley 

University Chemical Laboratory 
Lensfield Road 

Cambridge CB2 lEW, Great Britain 
E-mail: svllOOO@cus.cam.ac.uk 

Prof. Dr. Joachim Thiem 
Institut fiir Organische Chemie 
Universitat Hamburg 
Martin-Luther-King-Platz 6 
20146 Hamburg, Germany 
E-mail: thiem@chemie.uni-hamburg.de 

Prof. Dr. Fritz Vogtle 

Kekule-Institut fiir Organische Chemie 
und Biochemie der Universitat Bonn 
Gerhard-Domagk-Strafie 1 
53121 Bonn, Germany 
E-mail: voegtle@uni-bonn.de 



Prof. Kendall N. Houk 

Department of Chemistry and Biochemistry 

University of California 

405 Higard Avenue 

Los Angeles, CA 90024-1589, USA 

E-mail: houk@chem.ucla.edu 

Prof. Jean-Marie Lehn 
Institut de Chimie 
University de Strasbourg 
1 rue Blaise Pascal, B.P.Z 296/R8 
67008 Strasbourg Cedex, France 
E-mail: lehn@chimie.u-strasbg.fr 

Prof. Stuart L. Schreiber 
Chemical Laboratories 
Harvard University 
12 Oxford Street 

Cambridge, MA 02138-2902, USA 
E-mail: sls@slsiris.harvard.edu 

Prof. Barry M. Trost 

Department of Chemistry 
Stanford University 
Stanford, CA 94305-5080, USA 
E-mail: bmtrost@leland.stanford.edu 

Prof. Hisashi Yamamoto 

School of Engineering 

Nagoya University 

Chikusa, Nagoya 464-01, Japan 

E-mail: j45988a@nucc.cc.nagoya-u.ac.jp 




Topics in Current Chemistry 
Now Also Available Electronically 



For all customers with a standing order for Topics in Current Chemistry we offer 
the electronic form via LINK free of charge. Please contact your librarian who can 
receive a password for free access to the ftill articles by registration at: 

http://link.springer.de/series/tcc/reg_form.htm 

If you do not have a standing order you can nevertheless browse through the 
table of contents of the volumes and the abstracts of each article at: 

http:/ /link.springer. de/series/ tcc 

There you will also find information about the 

- Editorial Board 

- Aims and Scope 

- Instructions for Authors 




Preface 



The transduction of signals from the extracellular space across the plasma 
membrane into the interior of cells and ultimately to the nucleus, where in re- 
sponse to such external signals the transcription of the genetic code is influ- 
enced, belongs to the most fundamental and important events in the regulation 
of the life cycle of cells. During recent years several signal transduction cascades 
have been elucidated which regulate, for instance, the growth and the prolifera- 
tion of organisms as diverse as mammals, flies, worms and yeast. The general 
picture which emerged from these investigations is that nature employs a com- 
bination of non-covalent ligand/protein and protein/protein interactions 
together with a set of covalent protein modifications to generate the signals and 
transduce them to their destinations. The ligands which are recognized may be 
low molecular weight compounds like lipids, inositol derivatives, steroids or 
microbial products like cyclosporin. They may be proteins like, for instance, 
growth factors or intracellular adaptor proteins which carry SH2 or SH3 
domains, and they may be specific DNA stretches which are selectively recog- 
nized by transcription factors. 

These and other aspects of biological signal transduction provide an open 
and rewarding field for investigations by scientists from various different disci- 
plines of biology, medical research and chemistry working in academic research 
institutions or in industry. In particular, it is full of opportunities for bioorganic 
studies in which the expertises of biologists and chemists have to be merged, 
and it is highly relevant to medicinal chemistry since “signal transduction thera- 
py” is a new promising approach for the treatment of various diseases including 
cancer. 

This book presents a selection of the most advanced topics from the bio- 
organic and medicinal chemistry of signal transduction. The authors who have 
contributed include biologists, biochemists, medicinal chemists and synthetic 
organic chemists and work in universities and in industry. The topics which 
are covered address various important events involved in the transduction of 
chemical signals in biological systems as summarized above. It was my intention 
to bring together scientists from several subdisciplines and to highlight the pro- 
gress that has been achieved in this multidisciplinary field. In reviewing the 
papers which now finally appear in print I am confident that I have succeeded in 
my attempt. 



Dortmund, August 2000 



Herbert Waldmann 
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Protein tyrosine kinases (PTKs) play a key role in cell signaling and regulate biological 
processes such as proliferation, differentiation, and apoptosis. The malfunctioning of these 
proteins is the root of many diseases. Over 60% of all oncoproteins and proto-oncoproteins, 
which play a key role in cancers, are PTKs. Malfunctioning of PTKs is also the hallmark of 
other diseases such as psoriasis. Papilloma, Atherosclerosis, pulmonary fibrosis and more. 
It is therefore logical to target these proteins for drug design aiming at selective and non- 
toxic drugs. Since the second half of the 1980s we have pioneered and continued to generate 
tyrosine phosphorylation inhibitors (tyrphostins) as agents against diseases such as 
cancers, leukemias, lymphomas, psoriasis, restenosis, angiogenesis, and more. Currently, 
there are already a few tyrphostins in clinical trials. We argue that it is likely that the most 
effective tyrphostins in the future will be those which target the substrate binding domain 
of the PTK and not the ATP sub-domain. It is highly likely that tyrphostins against the key 
PTKs that play a pivotal role in diseases will become an important component in human 
therapy. 

Keywords: Tyrphostins, Protein tyrosine. Kinases, JAK-2 EGF, PDGF BCR-ABL, Psoriasis, 
Papilloma, Restenosis, Leukemia, Lymphoma 
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1 

Introduction 

Discoveries in the area of signal transduction pathways over the past two 
decades have revealed that many diseases result from the malfunctioning of 
these pathways. In fact, nearly every disease can be defined in molecular terms 
and can be characterized as a “fingerprint” of a set of biochemical lesions. As a 
consequence of this revolution in understanding disease an entirely new field 
has evolved known as “signal transduction therapy” [1]. Both small molecules 
and proteins can now be used as correcting agents for the malfunction of signal 
transduction pathways thus taking on the function of a drug. This review will 
concentrate on protein tyrosine kinases (PTKs). 

The malfunction of protein tyrosine kinases (PTKs) is the hallmark of 
numerous diseases. Not only do PTKs account for over 60% of oncogenes and 
proto-oncogenes involved in human cancers but enhanced activity of PTKs has 
also been implicated in such nonmalignant diseases as psoriasis, Papilloma, 
restenosis and pulmonary fibrosis. It is therefore not surprising that there has 
been such a surge of studies attempting to target PTKs for drug development 
over the past decade (for review see [2]b For many diseases the understanding 
of molecular pathology has so advanced as to enable identification of molecular 
aberrations. Among such malfunctioning signaling elements PTKs appear 
prominently. Table 1 summarizes those PTK signaling molecules whose altered 
activities have proven to be directly correlated with a human disease. Other 
PTKs are also implicated but the level of evidence is more correlatory than in the 
cases summarized in Table 1. 

2 

Universal Targets and Selective Targets 

In most cases of proliferative disease more than one signaling pathway is involv- 
ed; this is especially true in cancers where many genetic alterations have taken 
place on the cell’s pathway to its transformed state. Even in such benign condi- 
tions as psoriasis or restenosis several signaling pathways have been implicated. 
In all of the above cases more than one PTK is likely to be involved though 
in many instances it is possible to identify one PTK that plays a key role in 
the disease. Table 1 lists most of these PTKs and the diseases in which they are 
involved. These PTKs are candidates for drug targeting. PTKs such as IGF-IR 
and the Src family kinases have been implicated in many types of cancer thus 
qualifying as universal targets [2, 3] for the development of PTK inhibitors. 

Out of those PTKs involved in pathophysiological states a relatively small 
number show up in cancers and other proliferative states. For example, over- 
expression of the EGF receptor kinase is the hallmark of most, if not all. 
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Table 1. PTKs whose enhanced activities is correlated with specific diseases 



PTK involved 


Type of altered activity 


Disease implicated 


EGFR, unmutated 


Overexpression and autocrine 
stimulation 


Epithelial cancers, Psoriasis, 
Papilloma 


EGFR, truncated 


Constitutive activation 


Gliomas breast, and ovary 
cancers 


PDGFR 


Overexpression and autocrine 
stimulation. Activation of 
blood vessel smooth muscle 
cells in the media 


Glioblastomas, Restenosis, 
Atherosclerosis 


TEL-PDGFR 


Constitutive activation of 


Chronic myelomonocytic, 


fusion protein 


PDGF receptor 


Leukemia 


Bcr-Abl 


Constitutive activation in the 
cytoplasm of blood stem cells 


Chronic myeloid leukemia 
(CML) 


Jak-2 


Persistent activation 


Recurrent pre-B acute 
lymphoblastic leukemia 
(pre-B ALL); IL-6 dependent 
multiple myeloma 


Her-2/neu, Her-3, 
Her-4 heterodimers 


Overexpression 


Breast, ovary, lung, gastric 
cancers 


VEGFR/Flk-1/KDR 


Activation of tumor 
vascularization by the 
anoxic tumor 


All cancers 


c-Src, c-Yes 


Persistently activated 


Cancers of the lung, 
colon breast, prostate 


IGFl-R 


Overexpression, autocrine 
stimulation 


Cancers, psoriasis 



epithelial cancers. This is usually accompanied by the autocrine or paracrine 
expression of its ligands, producing persistent enhanced stimulation of EGF 
dependent pathways. In certain tumors, a truncated persistently active version 
of the receptor is overexpressed and induces intense signaling. It is therefore 
little wonder that attempts to generate EGFR kinase directed tyrphostins have 
been pursued since the search for PTK inhibitors began. The development of an 
inhibitor for the EGFR kinase which could possibly become a universal inhibitor 
should be regarded as an important objective. 

The involvement of enhanced EGFR signaling is also the hallmark of 
Papilloma induced by HPV 16/18 [4] and psoriasis [5]. Blockers of the EGFR 
kinase were therefore suggested as potent anti-psoriasis agents [5-7] and anti- 
Papilloma agents [4]. Indeed AG 1517(SU 5271 = PD 153035, see below) has 
been undergoing clinical trials since early 1997 [7]. Similarly, the involvement of 
HER-2/neu in breast, ovarian, lung and gastric cancers makes this close relative 
of the EGFR kinase an attractive target for drug design. Overexpression of 
HER-2 occurs in a significant proportion (10-35%) of the cases. In some 
instances it is possible to identify one particular PTK whose activity is correlated 
with the disease. For example, in the chronic phase of chronic myeloid leukemia 
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(CML) in which the fusion protein Bcr-Abl, the product of the Philadelphia 
chromosome, is implicated as the cause of the leukemia [8].TEL-PDGFR [9] and 
Jak-2 [10] are similarly associated with other forms of leukemia. In these cases 
highly selective inhibitors can be used to inhibit and even selectively purge the 
diseased cells. For most tumors, however, one signal transduction inhibitor may 
not in itself be sufficient to eradicate the disease. Indeed early in vivo experi- 
ments show that tyrphostin RG 13022, an EGFR kinase blocker, is capable of 
inhibiting the progress of tumor growth and prolonging survival of nude mice 
implanted with a human squamous tumor which overexpresses EGFR. Treat- 
ment with RG 13022 is not by itself sufficient to cause total eradication of the 
disease [11]. It is, therefore, probable that PTK inhibitors will prove most 
efficient in combination with other drugs such as cytotoxic drugs ([12, 13] and 
discussion below) or antibodies [11]. 

3 

Design and Synthesis of Protein Tyrosine Phosphorylation Inhibitors 

Once it became apparent that the enhanced activity of PTKs was a major 
contributor to oncogenesis, a search for tyrosine kinase inhibitors began. The 
systematic synthesis of selective PTK inhibitors (tyrosine phosphorylation inhib- 
itors = tyrphostins) that show selectivity towards the isolated EGFR kinase and 
do not inhibit Ter/Thr kinases was reported in the late 1980 s [14]. It took a few 
more years to produce highly potent and selective tyrosine kinase inhibitors, 
mostly by semi-rational drug design and high throughput screening (for review 
see [2, 3, 15, 16]). Kinetic analysis of the mode of EGFR kinase action and of 
pp60‘^'®‘'‘^ shows that ATP and the substrate bind independently to the kinase 
domain and no sequential binding occurs [17]. This property simplifies the 
kinetic analysis on the inhibitors’ mode of inhibition of PTKs [18]. 

One of the most surprising findings on the selectivity of inhibitors discover- 
ed so far is the extent to which ATP-competitive inhibitors can be selective. For 
example, quinoxalines are highly selective inhibitors of PDGFR kinase [19, 20] 
and quinozalines for the EGFR kinase ([21] and see below). Further analysis on 
the mode of tyrosine kinase inhibition reveals that the affinity of the inhibitor 
and its mode of binding to the kinase domain depend on whether the kinase is 
in its activated form or basal inactive state. Two examples illustrate this point: 
Activated Abl kinase like p210®“'*’’' and pl85®"’^’’' possess different affinities 
to both substrate and inhibitors (tyrphostins) as compared to the proto-onco- 
genic form pl40‘^'^’’' [8]. For example, p210 s>:i'-Abi pl85 Bcr-^bi inhibited by 
the tyrphostin AG 957 with Ki values of 0.75 pmol/1 and 1.5 ppmol/1 as compared 
to Ki = 10.0 pM for the cellular wild type pl40‘^’^'’'. In all these cases AG 957 is 
competitive with the substrate and non-competitive with ATP. The difference 
between c-Abl and Bcr-Abl is not found in the kinase domain, since the Bcr 
sequence is fused in-frame with the intact kinase domain. It would therefore 
appear that tethering the Bcr sequence upstream to the c-Abl alters the con- 
formation of the kinase domain such that it binds the inhibitor more tightly. The 
same pattern of behavior is observed vis-a-vis the substrates: the oncogenic 
forms exhibit lower Km values towards the substrates as compared to the proto- 
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oncogenic forms. A similar relationship was found for the mouse proteins where 
v-Abl(= gagAbl) exhibited higher affinity towards the substrates and inhibitors 
as compared to the proto-oncogenic form of the protein [8]. In the case of the 
PDGFR, kinase activation of the receptor also leads to changes in the structure 
of the kinase domain but the situation is slightly more complex: upon activation 
the mode of inhibition of the selective inhibitor AG 1296 (or AG 1295) is altered. 
Whereas the inhibitor is competitive vis-a-vis ATP in the inactive form of the 
receptor, it binds with higher affinity and becomes mixed competitive vis-a-vis 
ATP subsequent to receptor activation by PDGF [22] . These two examples suggest 
that this may be a general pattern of behavior for PTKs [23]. This finding 
also points to the necessity of examining potential PTK inhibitors not only as 
blockers of PTK autophosphorylation, which is done routinely, but also as 
blockers of the PTK action on exogenous substrates. 

With the advance of X-ray crystalography and the ability to determine the 
three dimensional structure of the kinase with the bound inhibitor, drug design 
has become more precise and rational. The three dimensional structures of a two 
tyrosine kinases complexed with a kinase inhibitor have already been solved: 

1. The structure of the FGF receptor with selective and non-selective inhibitors 
has already been published [24,25]. 

2. The Src kinase Hck has been crystallized in its inactive form with the inhib- 
itor PPl complexed with it [26]. Similarly, Lck in its active form has been 
crystallized with the inhibitor PP2 [27]. 

These structures are currently guiding a number of laboratories in their attempt 
to design novel, more selective Src kinase inhibitors. 

This being the case, the next challenge is to attempt to design compounds 
which selectively block pp60‘^'®’'‘^. Yes and Fyn are activated in many human 
malignancies ([28, 29] and references therein) but do not significantly block 
other Src family kinases involved in other pathways such as Similarly, 

the availability of the insulin receptor kinase structure in its inactive form [30] 
as well as in its active form complexed with APPNHP and a peptide substrate 
[31] has enabled an educated search for inhibitors for the IGF-IR kinase which 
is highly homologous to the insulin receptor kinase. 

This search has resulted in identification of AG 538 as a potent inhibitor of 
the IGF-IR kinase [32]. The AG 538 family, identified by this type of “educated” 
search was designed to find inhibitors which mimic the encompassing protein 
loop between tyrosine 1158 and tyrosine 1162 and actually occupies the sub- 
strate site in the inactive state. This then becomes phosphorylated upon activa- 
tion within the insulin receptor. Indeed AG 538 is capable of inhibiting IGFl-R 
autophosphorylation with an IC50 value of 0.4 pM [32]. This series of inhibitors 
is different from AG 1024 and some of its analogs, which were found by random 
screening [33]. It will be most interesting to compare the mode of binding of AG 
1024 to AG 538 within the active site of the receptor. It is important to note that 
some of the inhibitors discriminate between the insulin receptor kinase and the 
IGFl-R kinase by a factor of up to 8 [33]. The challenge remains to design inhib- 
itors of IGFl-R kinase which differentiate between the insulin receptor kinase 
and IGF-IR kinase by a wider margin. Table 2 summarizes the main pharmaco- 
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Table 2. Lead pharmacophores for PTK inhibitors 



BENZENE MALONO NITRILES 




Rl 



PYRAZOLO PYRIMIDINES 




Ra 



LAVENDUSTINS 




I 

Rz 



ANILIDO PHTHALIMIDES 

Rl 




QUINAZOLINES 




QUINAXOLINES 





phores which have proven to be effective PTK inhibitors with no significant 
effects on Ser/The kinases. Table 3 summarizes the specific compounds which 
have shown biological efficacy on the targets quoted. 

4 

ATP Mimics vis-a-vis Substrate Mimics 

It was initially argued that the best potential PTK inhibitors would be com- 
pounds that compete for the substrate in the kinase binding domain. It was 
argued that such compounds would be less toxic than ATP mimics since they 
bind to those domains at the kinase site that are less conserved than the sub- 
strate binding domains. Indeed tyrphostins like AG 490 which blocks Jak-2 [10] 
and AG 556 which possesses anti-inflammatory properties have been shown to 
be highly non-toxic in vivo [34-37]. 
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Table 3. Tyrphostins with biological activities 



AG 490 o 




AG 556 



O 




CHj CGP 59326A 

EGFR inhibitors 
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Over the past decade, however, the PTK inhibitors favored by most investiga- 
tors have been ATP mimics that compete with ATP at the binding site. Most of 
the compounds depicted in Table 2 are ATP mimics with the exception of the 
tyrphostins developed by us. In the case of tyrphostins one can indeed classify 
compounds which compete with ATP, compounds which compete with the sub- 
strate and bisubstrate inhibitors which compete with the substrate and ATP 
simultaneously [18]. Compounds can also be identified that act as “mixed” com- 
petitive inhibitors which bind simultaneously with ATP and/or substrate but 
decrease the affinity of ATP and the substrate to their respective sites [18, 22]. 
Among the tyrphostins all classes of compounds can be identified [18] but the 
real question is which of these is preferable for clinical development. 

In recent clinical trials, in which PTK inhibitors are being utilized, doses 
of 50-100 mg/kg have been used. These high doses reflect the relatively low 
efficacy of these compounds in vivo despite the fact that their IC50 values 
towards their molecular targets such as the EGFR, VEGFR-2/Flk-l are in the 
nanomolar range. The high doses required partly reflect the competitive rela- 
tionship between the intracellular millimolar concentrations of ATP and the 
administered drug. It is noteworthy that drugs such as beta-adrenergic blockers 
are administered at doses which are lower by about 100-fold. In this case 
a drug possessing an affinity in the nanomolar concentration range has to 
compete with up to 0.1 pmol/1 concentration of the endogenous ligand 
adrenaline or nor-adrenaline. Thus /1-blockers can be administered in this 
case at doses of 1.0 mg/kg and still be effective. It seems in light of this that the 
possibility of developing substrate competitive tyrosine kinase inhibitors as 
opposed to ATP-mimics should be reconsidered. A further consideration is the 
possibility of ATP-mimics being fairly toxic since the selectivity of newly devel- 
oped compounds are only tested against a limited number of PTKs and Ser/Thr 
kinases, where the number of PTKs falls in the 400-500 range and the number 
of Ser/ kinases is numbered in the thousands. It has already been obsereved that 
the so called selective Src family kinase inhibitor PPl is in fact equipotent as a 
PDGFR kinase inhibitor [38]. Similarly the Novartis Bce-Abl kinase inhibitor, 
CGP57148 [39] currently undergoing clinical trials for chronic myeloid leukemia 
(GML), is as potent against PDGFR kinase [40]. It is likely that this type of 
behavior will be encountered as progress continues to be made in developing 
ATP mimics as therapeutic agents. 

5 

Screening for Tyrosine Kinase Inhibitors 

The involvement of several tyrosine kinases in various cancers requires efficient 
screening methodologies for the inhibitory compounds. Screening is divided 
into three steps; ( 1 ) primary screening against the pure isolated PTK in a cell-free 
system. The objective is always an ELISA format. The compounds are screened 
against a battery of PTKs and Ser/Ther kinases in order that the pattern of selec- 
tivity can be established quickly [2]. 

Successful candidates, that show high affinity towards the isolated PTK move 
to (2) secondary screening in which the compound is tested for its potency in 
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inhibiting the PTK in the intact cell. Many compounds fail at this stage either 
because they are not permeable or do not reach their target in the intact cell. 
Compounds that pass this test are then examined against other kinases within 
the intact cell context. This screening is also conducted in a 96 well format and 
is therefore amenable to high throughput screening. Successful compounds are 
tested in (3) tertiary screening for their ability to block the growth of the cell 
when such growth is driven by the targeted PTK. This test is in many cases 
accompanied by an additional test, which examines the ability of the compound 
to block the growth of these cells on soft agar. Such potency indicates that these 
compounds could be successful in vivo. Successful compounds undergo (4) 
quaternary screening through testing in animal models. Animal models range 
from SCID mice or nude mice harboring the human disease to the inhibition of 
balloon injury induced stenosis in pigs as a model for restenosis. 

Due to the availability of chemical libraries, plant and microbial extracts and 
rapid screening methods lead compounds are rapidly identified. When the 
structures of PTKs are known optimization by organic synthesis and computer 
modeling follows. Compounds that successfully pass all the screening tests are 
ready to be evaluated for clinical trials. 

6 

Biological Activity of Tyrphostins 

A number of families of PTK inhibitors (tyrphostins) have shown efficacy in 
tissue culture as well as in vivo. A significant number of benzene malono nitriles 
(BMNs), which belong to the founding family of PTK inhibitors were found to 
possess biological activity. These include AG 490, AG 126 and AG 556 for which 
promising in vivo data already exist [10, 11,34-37]. 

Anilidopthalimides, which inhibit EGFR [41], also show efficacy in vivo. 
Pyrido pyrimidines like CGP 53716 which inhibit PDGFR [42], have shown 
efficacy in vivo. A similar compound, CGP 57 1486, from the pyrazolo pyrimidines, 
blocks Bcr-Abl and shows efficacy in intact cells harboring Bcr-Abl [43]. How- 
ever, this inhibitor also blocks PDGFR and Kit. AG 957, a substrate competitive 
tyrphostin derived from Lavendustin A, is very selective against Bcr-Abl kinase 
[8, 44] and has been shown to selectively purge Ph"^ cells from blood obtained 
from CML patients at the chronic phase of the disease [45]. Quinoxalines were 
found to be very selective PDGFR kinase inhibitors with very high efficacy in 
intact cells [19, 20]. The quinoxaline AG 1295 has also shown efficacy in vivo 
by inhibiting balloon injury induced stenosis in pigs ([46] and see below), a 
predictive animal model for restenosis in humans. In vivo activity of AG 1295 
was also shown in the rat model [47]. Quinozalines show good efficacy against 
the EGFR [7, 13, 48-50] and one of them, AG 1517 [13,49] which is identical to 
PD153035 [48] is currently in clinical trials for psoriasis [7]. This compound is 
very effective in blocking the growth of psoriatic keratinocytes [7]. Another 
quinazoline 2D 1839 (Table 3) developed by Zeneca is currently in clinical trails. 
A related pharmacophore (see Table 2), which has been shown to be effective 
against the EGFR kinase is derived from 4-(Phenylamino) pyrrolopyrimidine 
[ 5 1 ] , a comp ound which was rationally designed onthebasisof successful inhib- 
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itors from the dianilinphthalimide family. Yet another successful family of EGFR 
kinase inhibitors was obtained by the fusion of the quinozaline moeity with a 
third ring [52]. 

7 

The Synergistic Action of Tyrphostins with Cytotoxic Agents 
Antibodies and Cytokines 

The enhanced activity of PTKs generates anti-apoptotic robustness in cancer 
cells. This is most probably due to the enhancement of anti-apoptotic pathways 
mediated by BclX/Bcl-2 and c-Akt/PKB as well as by other biochemical elements. 
Intensification of anti-apoptotic pathways seems to be the predominant factor 
in the emergence of resistance to cytotoxic drugs and radiation. It is probable 
that these inhibitory pathways suppress the stress pathways triggered by cyto- 
toxic agents, thus preventing cell death. This principle has been demonstrated in 
two instances and the molecular mechanism has been at least partially analyzed. 
Tyrphostin AG 825 was shown to strongly synergize with ds-platin, etoposide 
and doxorubicin, all DNA damaging agents triggering the stress pathways [12]. 
A series of non-small lung cancer tumor [NLSG) cell lines with different levels 
of HER-2/neu overexpression was examined for this effect. 

The extent of synergy was found to be dependent on the level of expression of 
the HER-2/neu receptor. While no synergy was observed, additivity was in fact 
observed for cells with no overexpression of HER-2/neu, the highest synergy 
was found in cells with very high expression of HER-2/neu. This was reflected 
quantitatively in the combination index (GI) whose value was much lower than 
1.0 for the high overexpressors and equal to 1.0 for the cell in which HER-2/neu 
is not overexpressed [12]. It seems that the higher the overexpression of HER-2, 
the higher the anti-apoptotic shield provided by the receptor tyrosine kinase. It 
would seem however, that while the sensitivity of the cells to stress signaling is 
also enhanced upon higher expression of HER-2/neu, this sensitivity is masked 
by the high expression of HER-2/neu. Thus, when the cells are stripped of the 
robust anti-apoptotic shield their enhanced vulnerability to stress is exposed. 

A very similar result was obtained recently with human glioma cells, which 
overexpress the truncated form of the EGF receptor A (2-7)-EGFR. In this case 
EGFR kinase selective agent inhibitors such as AG 1478 or AG 1517 synergize 
with ds-platin (CDDP). It has been demonstrated that AG 1478 reduces the level 
of expression of Bc1-Xl and increases the activity of Gaspase 3. The combina- 
tion of the tyrphostin with cis-platin is lethal to the tumor cells since it leads to 
apoptosis [13]. It seems therefore that AEGFR transmits strong anti-apoptotic 
signals which, when inhibited, sensitize the cells to the stress signaling triggered 
by ds-platin. Such a success would allow clinical application of these tyrphostins 
as sensitizers of these tumors to cytotoxic drugs, to which they are normally 
insensitive. Indeed recently the Ludwig Institute for Gancer Research (LIGR) has 
begun preparation to treat patients with glioma multiformis with the combina- 
tion of AG 1478 and GDDP. 

The combinatorial approach, which combines a blocker of anti-apoptotic sig- 
naling with a stimulator of stress signaling, could prove to be a general approach. 
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It is therefore probable that blockers of Bc1-2/Bc1-Xl proteins and c-Akt/PKB 
will have a broad application for cancer therapy when they become available. 
However, since PTKs, especially in tumors, are frequently, if not always, the 
source of anti-apoptotic signals, tyrphostins may actually fulfill this task. In 
some cases such as Pre-B ALL the selective tyrphostin AG 490 was found to be 
sufficient by itself to induce apoptosis [10]. It seems that in this case stress sig- 
naling is so strong that dismantling of the cells from the anti-apoptotic signals 
is sufficient to induce apoptosis. This point, however, is still under investigation. 
In IL-6 dependent multiple myeloma cells, significant cell death can be attained 
by the Jak-2 inhibitor AG 490, but its effect is enhanced dramatically by the Fas 
receptor antibody CH-11[53]. Similarly, in CML one finds very siginificant 
synergy between the Bcr-Abl kinase inhibitor AG 957 and the Fas receptor anti- 
boby CH-1 1 in selectively purging Ph-l- cells [45]. 

The combination of tyrphostins with cytotoxic drugs can be followed by 
immunotherapy in order to eliminate residual disease. Though such combinations 
have not yet been examined, the combination of AG 490 and IL-12 against IL-6 
dependent multiple myeloma recently showed impressive tumor suppressive 
effects [54], suggesting that the general idea may indeed be correct. 

8 

Success of Tyrphostins In Vivo 

In many cases the hypothesis that PTK inhibitors provide an effective block to 
proliferative disease or tumor growth in vivo has in fact been proven in pre- 
clinical animal models. Such cases are listed below. 



8.1 

PDGFR Kinase Directed PTK Inhibitors 

Tyrphostin AG 1295 and its close analog AG 1296, both quinoxalines (see 
Table 2), have been shown to selectively block PDGFR kinase with insignificant 
inhibitory effects on EGFR, Src, Flk-1, HER-2/neu and IGEl-R. These tyrphostins 
have also been shown to reverse the transformed phenotype of sis transformed 
NIH 3T3 cells [19] and slow C6 glioma induced tumors in nude mice (Shawver, 
L., unpublished). In recent experiments on pigs, AG 1295 was shown to block 
balloon injury induced stenosis in the femoral artery [46]. The extent of inhibi- 
tion is -50-60% compared to the treatmemt of the balloon-injured femoral 
artery with the vehicle alone. Similar results were obtained for rats. The pig model, 
which is much more faithful to the human situation and a better predictor 
for humans, tends to indicate AG 1295 and its more potent analogs AGL 2034 and 
AGL 2044 (Gazit, Banai and Levitzki, in preparation) as candidates for clinical 
testing. A quinoxaline developed by Rhone-Poulenc-Rorer RPR has also shown 
efficacy in inhibiting stenosis in balloon injured minipigs fed on a high fat diet 
[55]. The ability of very selective PDGER kinase inhibitors to block balloon 
injury-induced stenosis supports Russel Ross’ (1976) “PDGE hypothesis” [56, 57] 
which implicated this growth factor and its receptor in the process of athero- 
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sclerosis and its accelerated form-restenosis. The finding that PDGFR kinase 
directed tyrphostins block -60% stenosis strongly suggests that, in order to 
block the other proliferative signals, other signaling pathways should be targeted. 
These include those signaling pathways elicited by FGF and TGFa. Since 
accelerated atherosclerosis in transplanted hearts is the major cause of death 
among heart transplant patients, AG 1295 and its analogs are currently being 
tested in an animal heart transplant model with very promising results (Karck 
and Levitzki, unpublished experiments). 



8.2 

Jak-2 Inhibitor 

Among the many tyrphostins, it has been shown that the AG 490 family are 
potent inhibitors of Jak-2 and to a lesser extent Jak-3 [58]. The high potency of 
AG 490 in blocking Jak-2 may well explain these agents’ ability to block recurrent 
pre-B acute lymphoblastic leukemia (Pre-B ALL) when engrafted in SGID mice 
[10]. AG 490 induces apoptotic death in Pre-B cells taken from patients suffering 
from this disease. Jak-2 is persistently activated in these Pre-B cells whereas in 
normal B cells this is not the case. It is still not known whether the persistent 
activation of Jak-2 is due to an activating mutation or to a robust autocrine 
stimulation. The efficacy of AG 490 in vivo accompanied by a complete absence 
of toxicity in normal blood tissue is gratifying [10]. This agent is currently 
undergoing evaluation for clinical trials. AG 490 also induces apoptosis in IL-6 
dependent multiple myeploma [53] and synergizes with the Fas antibody CH-1 1 
and with IL-12 to induce the eradication of the IL-6 dependent tumor [54]. 



8.3 

EGFR Kinase Inhibitors 

The eficacy EGFR kinase blockers from the early tyrphostin family was demon- 
strated in 1991. They were the first PTK blockers to show efficacy in vivo [11]. 
Due to their unfavorable pharmacokinetic properties, however, they did not 
make further progress in pre-clinical animal models and were later abandoned 
as possible candidates for the clinical set up. These experiments nonetheless 
constituted a milestone in the sense that they provided proof of the principle. 
These experiments showed for the first time that the activity of a tyrphostin 
identified in a cell-free system resulted in in vivo success. A few heterocyclic 
EGFR kinase inhibitors such as DAPH [41] and CGP 59326 A [59] have shown 
efficacy in vivo against tumors that overexpress EGFR. Furthermore, the combina- 
tion of this inhibitor with cytotoxic drugs such as adriamycin and vinblastine 
induced tumor regression. One quinazoline, AG 1517 (SU 5271), is currently 
being tested as an anti-psoriatic agent using topical application. Since no animal 
model is available for psoriasis, the potency of AG 1517 (SU 5271) in inhibiting 
the growth of psoriatic cells and the absence of adverse toxic effects has enabled 
the beginning of clinical trials. Since not only psoriatic cells [6, 7] but also 
Papilloma (HPV 16) infected keratinocytes are driven by the EGFR-kinase 
system [4] success may lead to the testing of EGFR-directed PTK inhibitors as 
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anti-Papilloma agents [4]. As with psoriasis, there is no accepted animal model 
for Papilloma. It is therefore likely that EGFR kinase inhibitors will go straight 
to the clinic, skipping the pre-clinical animal model. In the case of Papilloma, 
topical application is intended to prevent the transformation of the HPV 16 
infected tissue into cervical cancer [4]. 

A quinazoline developed by Zeneca, ZD 1839 (Table 3) has shown efficacy in 
the clinic against EGFR overexpressing tumors. AG 1478 [3, 29] is effective 
against glioma multiformis in cells [13] and in vivo (Gavenee W., unpublished 
experiments) only in combination with the cytotoxic drug CDDP (cis-platin). 
In 50% of these tumors the EGFR is truncated and is persistently active. We 
have shown that this truncated receptor emits very strong anti-apoptotic signals 
which result in the tumor’s drug resistance. Upon blockade of the truncated 
receptor by AG 1478, the tumor is re-sensitized to the drug and its growth is 
severely inhibited. Therefore, the combination of AG 1478 and GDDP is currently 
being developed for clinical trial for patients with glioma multiformis. 



8.4 

Tyrphostins as Anti-Inflammatory Agents 

A number of tyrphostins such asAG 126 [34,37] andAG556 [34-37] have shown 
high efficacy as anti-inflammatory agents. The most prominent is AG 556, which 
was found to be effective as anti-sepsis agent in mice [36, 37] and dogs [35] and 
was very effective in alleviating the symptoms of experimental autoimmune 
encephalitis, even with late administration [60]. AG 126 was found to be effec- 
tive in preventing EPS-induced mortality in mice [37] and to alleviate cirrhosis- 
like symptoms in rats [34]. The search for such compounds within the tyrphostin 
family was based on scattered reports in the literature that the activation of 
PTKs is on the pathway of EPS induced inflamatory response. Similarly, scatter- 
ed reports suggest that the action of TNFa involves PTK activation at some point 
on the overall signaling network activated by the cytokine (see, for example, 
[61-65]). Since no specific molecular target has been defined, screening for 
active tyrphostins was based on the potency of compounds tested to block EPS 
induced TNFa production and TNFa action [37]. It remains to be seen whether 
the biological activity of this family of tyrphostins is exclusively due to the inhibi- 
tion of PTKs. 
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Within the last few years, therapeutic intervention into signal transduction cascades attracted 
the interest of pharmaceutical research. Apart from selective inhibition of phosphorylating 
and dephosphorylating enzymes, medicinal chemistry has focused on an SH2 domain-target- 
ed drug design approach. This review will briefly summarize the role of adaptor proteins in 
signaling networks, followed by an introduction on the functional and structural details of 
SH2 domains in terms of protein architecture and mode of ligand binding. Major emphasis is 
laid on medicinal chemistry programs that aim to develop peptidomimetic SH2 domain inhibi- 
tors. Different strategies for phosphotyrosine-replacement will be presented, followed by a 
detailed discussion of the iterative optimization cycles revealing new peptidomimetic and 
non-peptide SH2 domain antagonists. Finally, an outlook is given on further applications of 
the modular chemistry concept and the privileged building blocks, that have been developed 
in SH2 domain inhibitor programs, onto a new class of adaptor proteins, notably the PTB 
domains. 
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List of Abbreviations 

Abu a-aminobutyric acid 

AcsC 1-amino-cyclohexanecarboxylic acid 

Achec 2-amino-cyclohex-3-enecarboxyxlic acid 

Aib a-amino-isobutyric acid 

APP amyloid precursor protein 

CARD caspase recruitment domain 

chmF 4-(carb oxyhydroxy methyl) p henylalanine 

cmF 4-(carboxymethyl)phenylalanine 

cmT 4-(carboxymethyl)tyrosine 

DEATH domains involved in cell death (apoptosis) 

F 2 cmF 4-(carboxydifluoromethyl)phenylalanine 

FOMT 0-(2-(2-fluoromalonly))tyrosine 

FPmp 4-(phosphonofluoromethyl)phenylalanine 

F 2 Pmp 4-(phosphonodifluoromethyl)phenylalanine 

F 4 Pmp 2,3,5,6-tetrafluoro-4-(phosphonomethyl)phenylalanine 

Hey homocyclohexylalanine 

HGF hepatocytic growth factor 

HPmp 4-(phosphonohydroxymethyl)phenylalanine 

IRS-1 insulin receptor substrate 1 

ITAM immune receptor tyrosine activation motif 

Jak Janus kinase 

MDGK Madin-Darby canine kidney 

OMT 0-(2-malonyl)tyrosine 

PDGF platelet-derived growth factor 

PDZ domain found in PSD-95, Dig, and ZO-1/2 proteins 

PH pleckstrin homology domain 

PI phosphotyrosine-interaction domain 

Pmf p-(2-malonyl)phenylalanine 

Pmp 4-(phosphonomethyl)phenylalanine 

PTB phosphotyrosine binding domain 

PTK protein tyrosine kinase 
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PTP protein tyrosine phosphatase 

pTyr phosphotyrosine 

RTK receptor tyrosine kinase 

SF scatter factor 

SH2 src homology 2 domain 

SH3 src homology 3 domain 

SMART simple molecular architecture research tool 

Stat signal transducers and activators of transcription 

1 

Introduction 

Over the past few years, novel opportunities for pharmaceutical research emerg- 
ed from a deeper understanding of intracellular signal transduction pathways 
together with the involved molecular components and intermolecular interac- 
tion events [ 1 - 6] . By revealing the complex networks that generally underlie the 
extracellular ligand-induced regulation of cell function, numerous novel mole- 
cular targets have been suggested for a therapeutic intervention into signal 
transduction-related disorders. Diseases resulting from aberrant signal trans- 
duction processes range from cancers, atherosclerosis, or psoriasis as examples 
of proliferative disorders, over inflammatory conditions such as sepsis, rheuma- 
toid arthritis, multiple sclerosis, or tissue rejection, to various immunological 
disease states, and metabolic disorders [1,4, 6]. Consequently, current medicinal 
chemistry is in the process of refocusing its efforts in seeking new approaches to 
disease management by controlled intervention into signal transduction 
pathways. Apart from the more classical approach of medicinal chemistry to 
interfere in signal transduction cascades by low-molecular weight enzyme in- 
hibitors of, e.g., receptor tyrosine kinases (RTKs), cellular tyrosine kinases, 
farnesyl transferases, or phospholipases, the controlled modulation of non- 
catalytic adaptor protein domains emerges as a novel research area [2, 5, 7]. 

Signaling cascades are choreographed to a large degree by exactly those non- 
catalytic signaling domains such as SH2, SH3, PTB, PDZ, PH, WW, CARD, 
DEATH, 14-3-3, etc., which mediate the propagation of a receptor activation 
event through the cytoplasm by establishing highly specific protein-protein 
interactions [7- 10]. In this context, phosphorylation of particular tyrosine resi- 
dues serves as an on-off switch for, e. g., SH2 domain binding, while the sequen- 
tial peptide environment around a phosphotyrosine (pTyr) confers a secondary 
level of specificity. Therefore, the pTyr determines a critical pharmacophore in 
cellular signaling [7, 1 1]. Its inappropriate expression, either through generation 
by protein tyrosine kinases (PTKs) or cleavage by protein tyrosine phospha- 
tases (PTPs), contributes to the aforementioned diseases. Agents that selectively 
modulate pTyr-based molecular recognition events through interference into, 
e. g., the SH2 domain-mediated protein-protein interactions are not only con- 
sidered as useful biochemical tools, but are believed to yield new medicines. 

The design of such low-molecular weight compounds that inhibit or mimic 
the action of a naturally occurring protein epitope or peptide ligand is a con- 
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tinuing challenge for modern medicinal chemistry [12-15]. Peptides generally 
lack the appropriate physicochemical properties and metabolic stability to be 
useful therapeutic agents. This has led to the well-established concept of pep- 
tidomimetics that comprises a stepwise de-peptidization approach based on the 
knowledge of the structural requirements that govern the corresponding 
ligand-target binding event. 

This contribution will summarize the currently pursued approaches towards 
non-peptide SH2 domain antagonists and the development of promising lead 
structures for SH2 domain-associated disease processes. It is clearly beyond the 
scope of this review to provide an in-depth overview of the entire research area. 

After introducing functional and structural aspects of SH2 domains and the 
mode of ligand binding, special emphasis will be laid on lead finding attempts 
that utilize a modular, peptidomimetic-based design concept for the develop- 
ment of non-peptide SH2 domain inhibitors, rather than discussing any projects 
that remain in the realm of linear or cyclic oligopeptides. The corresponding 
section will be divided into two distinct parts: first pTyr mimics are introduced, 
followed by the discussion of promising oligopeptide mimetics. Instead of 
listing the distinct research programs pursued in different research institutes 
chronologically, a conceptional classification of the lead structures of different 
generations is preferred, thus accounting for the steady decrease in peptide char- 
acter achieved throughout an iterative optimization process. This should allow 
the reader to appreciate fully the distinct steps that finally led to a significant 
structural departure from the original peptide lead sequence. Finally, an oppor- 
tunity for extrapolating the elaborated peptidomimetic building blocks as well 
as the modular chemistry concepts from the SH2 domain target family onto a 
second class of adaptor proteins, notably the PTB domains, will be highlighted. 

Even though adaptor proteins do not function as classical receptor proteins 
or catalytically active enzymes, the terms “antagonist” and “inhibitor” will be 
used interchangeably throughout the text for reasons of convenience. 

2 

SH2 Domains 



2.1 

Adaptor Proteins in Signal Transduction 

A signal is defined as “any event or action that causes some general activity” 
[16]. According to that definition, the function and activity of cells is primarily 
controlled by external signals that modulate intracellular events upon binding 
to transmembrane cell surface receptors. The subsequent activation of the cell 
surface receptor by extracellular factors is transduced over the cellular mem- 
brane into the cytoplasm, causing the intracellular receptor domains to establish 
contact with “downstream” signal transducing proteins. This initial intracellular 
protein-protein interaction event then stimulates a cascade of non-catalytic as 
well as catalytically modifying protein-protein contacts, by which the biological 
signal is disseminated throughout the cell along multiple intracellular pathways. 
Most generally, the term signal transduction refers to a controlled flow of bio- 
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logical information by a sequential or parallel engagement of molecular regula- 
tors with the aim to elicit a cellular response [17]. 

The biochemical paradigm of signal transduction pathways is the ordered 
alteration of the phosphorylation state of surface-exposed tyrosine (Tyr), threo- 
nine (Thr), or serine (Ser) residues on target proteins. Thus, the molecular re- 
cognition of phosphorylated proteins, together with enzymatic phosphorylation 
and dephosphorylation is the primary mechanism of cellular signal transduc- 
tion. Apart from kinases and phosphatases, numerous cytoplasmic proteins 
involved in signal transduction pathways are not active in the classical fashion 
of enzymes. These non-catalytic adaptor proteins or protein domains confer a 
remarkable specificity to the distinct signal transduction pathways in which 
they recruit enzymes into signaling networks by establishing spatial proximity 
between the biochemical reaction partners [18]. This fundamental process of 
scaffolding and anchoring pursued by adaptor proteins governs where and 
when corresponding kinases, phosphatases, and other enzymes involved are 
activated. In general, the enzymatic activity of a catalytically active signaling 
protein is encoded on a distinct protein domain separate from their adaptor 
region. These binding regions are localized on modular protein domains 
demonstrating a core recognition ability which is conserved throughout the 
entire homology family a particular adaptor module belongs to. By the repeated 
use of adaptor proteins and the associated “lock-and-key” recognition events, a 
complex and diverse regulatory network of molecular interactions can be 
assembled [18]. In several cases multiple and different adaptor protein domains 
and enzymes are encoded on a single polypeptide chain, thus allowing a single 
multiple-domain protein to bind multiple targets. Within signaling cascades, 
this is used to couple activated receptors to several downstream targets and 
associated pathways or, alternatively, to increase affinity and specificity by 
simultaneously employing more than one adaptor for a single binding event 
[18]. 

Consequently, nature utilizes a combination of adaptor protein domains, to 
ensure specificity by forming signaling complexes. The adaptor protein-mediat- 
ed assembly of signaling components into pathways is typified by the associa- 
tion of phosphorylated or autophosphorylated receptor tyrosine kinases (RTKs) 
with cytoplasmic multi-domain proteins. Currently, numerous adaptor protein 
modules are known to be involved in orchestrating the correct repertoires of 
enzymes into individual signal transduction pathways [8- 10], the most promi- 
nent representations being the SH2 [19, 20], SH3 [21-25], PTB [26-28], PDZ 
[29-32], DEATH [33-36], WW [37-41], or 14-3-3 [42-44] domains. As of 18 
March 2000 the SMART (simple molecular architecture research tool) database 
at the EMBL in Heidelberg [9, 10] annotated the amazing huge number of, e.g., 
154 distinct SH2 domains in 143 human signaling proteins, 359 SH3 domains in 
268 distinct human proteins, 55 PTB domains in 51 human proteins, 282 PDZ 
domains in 157 human proteins, 79 WW domains in 45 human proteins, and 10 
14-3-3 domains in 10 human proteins. This impressive list of currently known 
signaling domains together with associated background information on func- 
tional and structural aspects can be accessed via the World Wide Web under 
http://smart.embl-heidelberg.de/smart/domain-table.cgi [9, 10]. 
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2.2 

Function of SH2 Domains 

Cellular signal transduction is initiated by the extracellular binding of an ago- 
nist to its receptor which is followed by phosphorylation of regulatory tyrosines 
on the cytoplasmic receptor domains. Phosphorylation is achieved either by 
receptor tyrosine kinases (RTKs) which are integral part of the receptor protein 
itself, or by non-receptor tyrosine kinases that are recruited to the receptor 
protein from the cytoplasm. This cytoplasmic receptor autophosphorylation 
or phosphorylation acts as a switch to induce physical association between 
the activated receptor and the adaptor protein modules. Most generally, an 
adaptor is defined as a “device that connects pieces of equipment that were not 
originally designed to be connected” [16]. SH2 domains act as an adaptor 
by connecting activated receptors with downstream signaling proteins such 
as kinases, phosphatases, phospholipases, or GTPases, respectively (Fig. 1) 
[45-47]. The SH2 domain was first identified on the oncoproteins Src [48] and 
Fps [19]. Functionally, SH2 domain-containing signaling proteins can be classi- 
fied in two groups, multi-domain enzymes and catalytically inactive adaptors. 
The family of enzymatically active SH2-containing proteins comprise Src, Lyn, 
Hck, Syk, ZAP70, Syp, Btk, Tec, Abl, Itk, Csk, PLCy, and GAP. Among the en- 
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Fig. 1. Schematic representation of the modular architecture of selected pharmaceutically 
relevant SH2 domain-containing targets involved in signal transduction. For further details 
refer to http://smart.embl-heidelberg.de [9, 10] 
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zymatically inactive adaptor proteins are Grb2, Grb7, p85a, She, Nek, or Tensin 
(Fig- !)• 

SH2 domains specifically recognize phosphorylated tyrosine (pTyr) residues 
on cytoplasmic receptor domains or soluble signaling components [49-51]. 
The pTyr- flanking amino acid residues constitute the SH2 domain docking 
site and determine which SH2-containing signaling proteins associate with 
the phosphorylated protein surface [52-56]. Multiple pTyrs on, e.g., a single 
cytoplasmic receptor domain can recruit numerous different SH2 domain pro- 
teins as exemplified for the PDGF (platelet-derived growth factor) receptor 
(Fig. 2) that attracts 8 different SH2 domain proteins in 12 distinct binding 
events. 

Upon recognizing a pTyr-presenting ligand, SH2 domain proteins can be 
phosphorylated, thus recruiting other signal transducers to the activated signal- 
ing complex, or altering the intrinsic enzymatic activity of an SH2-containing 
enzyme. Apart from the pure adaptor function and the enzyme activation, SH2 
domain proteins have been shown to translocate to the cell nucleus where they 
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Fig. 2. Presentation of the intracellular domain of the PDGF (platelet-derived growth factor) 
receptor. The two catalytically active kinase domains are highlighted. Docking sites for SH2 
domains are symbolized with a pTyr sidechain. SH2 domains identified to bind to the corres- 
ponding phosphorylated tyrosines (numbered on the left) are schematically depicted and 
labeled on the right 
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Fig. 3. IL-6 receptor-mediated Jak-Stat pathway [57-59] 



regulate gene transcription upon DNA binding. The Jak-Stat pathway ( Jak; Janus 
kinase; Stat: signal transducers and activators of transcription) (Fig. 3) is an il- 
lustrative example for a straightforward signal transduction cascade involving 
only a few distinct steps [57-59]. The non-receptor tyrosine kinase Jak asso- 
ciates with an activated transmembrane receptor, e.g., the IL-6 receptor, and 
subsequently phosphorylates members of the Stat protein family. Due to the 
occurrence of SH2 domains in Stat proteins, they associate to dimers, stabilized 
by the reciprocal interaction of the SH2 domain of one Stat protein with the pTyr 
ligand epitope on the opposing Stat protein and vice versa. The dimers trans- 
locate from the cytoplasm to the nucleus where they specifically bind to DNA 
and activate gene transcription (Fig. 3) [57-59]. 

The potential binding sites for various SH2 domain proteins on cytoplasmic 
receptor domains as well as on soluble proteins have been precisely mapped for 
several pTyr-containing target proteins. Apart from the impressive diversity of 
SH2 domain function, the fundamental property of all SH2 domains refers to the 
specific recognition of pTyr epitopes. 
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2.3 

Structure of SH2 Domains 

SH2 domains generally consist of approximately 100 amino acid residues [49, 50, 
60, 61 ] and have been first identified as a conserved sequence region between the 
oncoproteins Src and Fps [19, 48]. By means of sequence homology, SH2 
domains have been uncovered in numerous other intracellular signal transduc- 
tion proteins (Figs. 1 and 4) [20]. 

Due to the ready accessibility of SH2 domains by molecular biology techni- 
ques, numerous experimentally determined 3D structures of SH2 domains 
derived by X-ray crystallography as well as heteronuclear multidimensional 
NMR spectroscopy are known today. The current version of the protein struc- 
ture database, accessible to the scientific community by, e.g., the Internet 
(http://www.rcsb.org/pdb/) contains around 80 entries of SH2 domain struc- 
tures and complexes thereof. Today, the SH2 domain structures of Hck [62], 
Src [63-66], Abl [67], Grb2 [68-71], Syp [72], PLCy [73], Fyn [74], SAP [75], 
Lck [76, 77], the C- and N-terminal SH2 domain of p85a [78 -80], and of the tan- 
dem SH2 domains Syk [81, 82], ZAP70 [83, 84], and SHP-2 [85] are determined. 
All SH2 domains display a conserved 3D structure as can be expected from 
multiple sequence alignments (Fig. 4). The common structural fold consists of 
a central three-stranded antiparallel p sheet that is occasionally extended by 
one to three additional short strands (Fig. 5). This central j3 sheet forms the 
spine of the domain which is flanked on both sides by regular a helices [49, 
50, 60]. 
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Fig. 4. Multiple sequence alignment of six distinct SH2 domain sequences. The SH2 domain 
notifiers are given on the left, the bottom line represents the consensus sequence as analyzed 
from the alignment. Highly conserved sequence positions are marked with capital letters. The 
sequential position of the major secondary structure elements (helices and sheets) are indi- 
cated by boxes 
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Fig. 5. Side-by-side stereo presentation of a ribbon diagram of the Lck SH2 domain, Protein 
Databank entry code: IBHF.PDB [118] 



In most SH2 domains the peptide ligands bind in an extended backbone 
conformation towards the edge strand (strand 3) of the central j3 sheet in an 
orthogonal orientation. The protein part accommodating the N-terminal a helix 
is primarily concerned in providing the binding determinant for the pTyr 
residue within peptide or protein ligands, whereas the opposite compartment 
formed by the C- terminal a helix provides interaction partners for ligand 
residues downstream of pTyr. The protein architecture is tailor-made for an 
“end-on” recognition mode, since both termini of SH2 domains are located 
opposite to the ligand binding site, thus not affecting ligand recognition upon 
covalent N-, C-, or N- and C-terminal incorporation of the SH2 domain into a 
multi-domain protein context [49, 50, 60]. 



2.4 

Mode of Ligand Binding 

The majority of pTyr-containing peptide or protein ligands bind in an extended 
conformation [49, 50, 60]. However, significant deviations are observed for, e.g., 
Grb2-peptide complexes, in which the ligand is bound in a tight reverse turn 
conformation of type pi [68-70]. In all SH2 domains, the pTyr sidechain pro- 
jects into a defined binding pocket, aligned with residues capable of forming an 
extended network of hydrogen bonds with the phosphate groups (Fig. 6). Highly 
conserved Arg residues coordinate the phosphate by bidentate hydrogen bonds 
reinforced by additional charge-charge interactions. 

Apart from the hydrogen bond potential at the bottom of the pTyr pocket, the 
phenol ring of pTyr makes a number of hydrophobic contacts with the SH2 
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Fig. 6. Schematic diagram of the peptide-protein interaction mode as seen in the crystallo- 
graphically refined structured of the Lck SH2 domain-peptide complex, Protein Databank 
entry code: ILKK.PDB. The residues directly engaged in intramolecular hydrogen bonds 
{dotted lines) are labeled explicitly 



domain. A second pronounced pocket, primarily lined with hydrophobic resi- 
dues, binds the amino acid sidechain three positions towards the C-terminus 
relative to the pTyr residue (pTyr-l-3). Due to the observation that pTyr peptides 
bind to most of the SH2 domains by addressing these two binding pockets 
(pTyr-pTyr-l-3), the binding mode has been described as a “two-pronged plug 
engaging a two-holed socket” (Fig. 7) [64]. 

In a variety of SH2 domains the “two-holed socket” template for binding the 
“two-pronged plug” is less pronounced than in others, and in some cases ligand 
residues further downstream than pTyr-l-3 are bound in hydrophobic pockets. 
The pTyr-pTyr-l-3 connecting residues are generally surface exposed with sol- 
vent-accessible sidechain functionalities and, apart from a hydrogen bond 
involving the pTyr-t-1 backbone N-H,no specific direct intermolecular contacts 
are conserved [49, 50, 60, 64]. 

This specific interaction mechanism whereby the binding is hypothesized to 
resemble a “two-pronged plug” (peptide, protein ligand) inserting into a “two- 
holed socket” (SH2 domain) predicts predominantly a hydrophobic interaction 
for high-affinity binding. This is mainly determined by the level of insertion of 
the pTyr-l-3 residue into a hydrophobic pocket (Figs. 7 and 8). 

However, thermodynamic investigations of this binding event employing, 
e.g., isothermal titration calorimetry revealed the “two-pronged plug - two- 
holed socket” model to be an oversimplification of the details underlying the 
true binding mechanism [86-97]. No significant differences within the spe- 
cific thermodynamic signature were detected for binding of different peptides 
with altered sidechains of variable size and hydrophobicity at the pTyr-l-3 
position. 
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Fig. 7. Side-by-side stereo presentation of the Lck SH2 domain (IBHF.PDB). The protein is 
shown as a ribbon diagram, while the ligand binding site is highlighted by a Connolly surface 
in line-mode. The two interaction-mediating ligand sidechains bind into the “two-holed 
socket”, the binding pockets of which are marked by an asterisk {left, pTyr-binding pocket), 
and a cube (ri^/it, pTyr-l-3 sidechain) 




Fig. 8. Lck SH2 domain-peptide complex (Ac-cmF-Glu-Glu-Ile-OH, 12) revealing the “two- 
pronged plug engaging a two-holed socket” binding mode, reminiscent of the majority of SH2 
domains (Protein Databank entry code: IBHF.PDB [118]). The protein is depicted in a Con- 
nolly surface mode, the ligand is given in a ball-and-stick representation. The cmF residue is 
deeply buried in its binding pocket (left) 
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In these experiments, only moderate heat capacity changes upon binding of 
high- and low-affinity pTyr peptides to SH2 domains were observed, suggesting 
that high-affinity binding may not be determined solely by a hydrophobic effect 
centered at the pTyr-l-3 position. In the case of the Src SH2 domain-pTyr-Glu- 
Glu-Ile peptide interaction, it could be calorimetrically shown that the pTyr+2 
position, in this case Glu, was nearly as critical for high-affinity binding as the 
pTyr-l-3 located He [ 94] . In the light of numerous high-resolution structures, this 
finding is rationalized with a water-mediated hydrogen bonding network ema- 
nating from the pTyr-l-2 Glu sidechain to polar and charged surface-exposed 
residues of the Src SH2 domain [95]. 

Following a combined spectroscopic and calorimetric study on the binding 
mechanism of the tandem SH2 domain of Syk and ITAM (a nanomolar-binding 
dually phosphorylated peptidic immune receptor tyrosine activation motif), an 
even more complicated binding mechanism characterized by a ligation-coupled 
conformational change of the tandem SH2 domain was observed [97]. The 
studies further revealed the single SH2 domains to be rather rigid locks or 
sockets when binding their keys or plugs, respectively. No remarkable induced- 
fit effects were observed upon ligand binding [97]. 



2.5 

Specificity of SH2 Domains 

The specificity profile of individual SH2 domains was determined in a series of 
studies employing degenerate phosphopeptide library screens [53, 98] from 
which two consensus motifs emerged for SH2 domain binding [52-55, 99, 100]. 
One group of SH2 domains preferentially binds to a pTyr-Glu-Glu-Ile motif that 
defines a generic recognition sequence: 

pTyr-Xaa-Xaa-Yaa, 

Xaa being hydrophilic and/or negatively charged, and Yaa being bulky and 
hydrophobic. 

The second group of SH2 domain proteins discriminates less and selects 
mainly hydrophobic residues with a consensus sequence: 

pTyr-Yaa-Any-Yaa, 

Yaa being hydrophobic. 

Based on these sequence-affinity relationships it was assumed that ligand 
selection has to be performed by SH2 domain residues strategically aligning the 
binding pocket which specifically interact with pTyr-exposing tetrapeptide epi- 
topes on the protein binding partner. Consequently, the paradigm of most SH2 
domain-targeted inhibitor design attempts relies on the assumption that utiliz- 
ing non-natural amino acids and peptidomimetic building blocks will permit 
the synthesis of de-peptidized SH2 domain antagonists with improved affinity, 
specificity, stability, and pharmacokinetic properties. 



30 



G. Muller 




two-pronged plug engaging a two-holed socket 

Fig. 9. Peptidomimetic design principle of SH2 antagonists derived from a fragmentation of 
the native peptide ligand. The lead finding efforts discussed in Sect. 3 are classified according 
to the modules given above the peptide lead sequence 



3 

Peptidomimetic SH2 Domain Antagonists 

Numerous efforts towards the design and synthesis of promising lead structures 
specifically targeting the ligand binding site of distinct SH2 domains have been 
described mainly within the last five years, highlighting the novelty of this re- 
search area within current medicinal chemistry (for reviews see[7,ll,101]).The 
chemistry programs pursued in different research groups in industry and aca- 
demia were immensely stimulated, if not initiated, by the experimentally deter- 
mined 3D structures of SH2 domains and their complexes with peptide ligands. 
Since the targeted protein-protein interaction event involves a sequentially con- 
tinuous binding epitope, comprising the pTyr 1 on the SH2 domain ligand side, 
a modular peptidomimetic-based strategy was chosen by most research groups 
(Fig. 9). 

According to the peculiarities of the ligand binding mode and the specificity 
profiles described in Sects. 2.4 and 2.5, respectively, the lead design problem can 
be separated into individual subtopics as shown in Fig. 9. 

Prior to the discussion of biologically active peptidomimetic and non-pep- 
tide SH2 domain antagonists (Sect. 3.2), the following section will introduce a 
variety of attempts towards the replacement of pTyr 1 (Scheme 1). 



3.1 

Phosphotyrosine Mimetics 

The hydrolytic accessibility of the ester bond in pTyr 1 (Scheme 1) to cellular 
protein-tyrosine phosphatases motivated numerous research groups to develop 
phosphatase-resistant bioisosteric analogues. It was soon recognized that the 
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Scheme 1 . Phosphorus-containing pTyr mimics 



phosphate structure itself imparts critical functionality to the tyrosine. There- 
fore the search for phosphate mimics that can be introduced onto, e.g., 
phenylalanine platforms has finally attracted the interest of classical peptido- 
mimetic research. In the following five sections (Sects. 3.1.1 -3.1.5) representa- 
tive examples will be given that are classified into phosphorus-containing and 
non-phosphorus-based pTyr analogues, the latter being roughly grouped into 
different classes according to the “spatial density” of oxygen atoms. 

3.1.1 

Phosphonates 

The phosphonate analogue Pmp 2 [4-(phosphonomethyl)phenylalanine] 
(Scheme 1) can be considered as one of the closest pTyr mimics, since the only 
structural modification is the replacement of the phosphate ester oxygen by a 
methylene group [102-107]. Pmp-containing peptides generally display de- 
creased binding affinities for a specific target SH2 domain when compared to 
the pTyr-containing parent peptide. This was attributed to an increased pK ^2 of 
the phosphonate 2 (7.1) relative to the pTyr 1 (5.7) and to the loss of hydrogen 
bonding capabilities [63, 108, 109]. A decrease of the corresponding pK^ was 
achieved by introducing electron-withdrawing groups onto the methylene 



32 



G. Muller 



bridge of Pmp 2. In a comparative study using the C-terminal SH2 domain of 
p85a and a standard peptide ligand, different modified Pmp analogues 2-5 
(Scheme 1) were investigated [110]. It turned out that only with p 2 Pmp 5 
[4-(phosphonodifluoromethyl)phenylalanine] affinities comparable to that of 
the pTyr peptides could be achieved, while FPmp 4 [4(phosphonofluoromethyl)- 
phenylalanine], Pmp 2, and HPmp 3 [4-(phosphonohydroxymethyl)phenyl- 
alanine] produced peptides with decreased affinities. The determined binding 
potencies increased in the order; HPmp 3 < Pmp 2 < FPmp 4 < p 2 Pmp 5 =; pTyr 1 
[ 110 ]. 

Of these analogues, mainly FjPmp 5 was repeatedly used for designing 
peptidic antagonists for Src [54, 110-112], Grb2 [110], p85a [113-115], and 
Abl [112] SH2 domains. p 2 Pmp 5 is generally considered as an acceptable 
substitution for pTyr 1, maintaining remarkable affinity while imparting suffi- 
cient stability to intracellular phosphatases. FjPmp-containing peptidomimetic 
p85a SH2 domain antagonists were shown to have specific cellular effects upon 
microinjection, but proved to be cytotoxic in whole-cell assays [115]. In a 
related study, a fully protected diester of a p 2 Pmp-containing dipeptide deri- 
vative 6 was used in a prodrug concept to block the Src and Abl SH2 domains 
[ 112 ]. 




The compound was readily taken up in Balbc3T3 cells and converted into the 
deprotected parent compound with acceptable reconversion rates. 

In an alternative effort to improve the phosphonate ionization capability and 
hydrogen binding capacity, a Pmp analogue F 4 Pmp 7 [2,3,5,6-tetrafluoro-4- 
(phosphonomethyl)phenylalanine] (Scheme 1) was designed and synthesized 
with a tetrafluoro phenyl ring [116]. Unexpectedly, the ionization properties 
of p 4 Pmp 7 were even closer to Pmp 2 (pK ^2 (p 4 Pmp 7) ; 6.9 relative to pK ^2 
(Pmp 2): 7.1) than to pTyr 1 (pK^ 2 : 5.7). Consequently, p 4 Pmp 7 incorporated 
into a standard peptide ligand for the Grb2 SH2 domain yielded even poorer 
affinity when compared to the non-fluorinated analogue Pmp 2, suggesting that 
the tetrafluorinated aromatic ring might sterically interfere with the pTyr- 
binding pocket, thus contributing to the reduced affinity [116]. 

Deletion of the phosphate ester oxygen of 1 results in 4-phosphono- 
phenylalanine, Phe(P 03 H 2 ) 8 (Scheme 1) which was incorporated in Abl and Src 
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SH2 domain-targeted peptidomimetic compounds [112] and displayed affini- 
ties comparable to those of F 2 Pmp-, or pTyr-containing analogues. 

Within the phosphonomethyl-series of pTyr analogues a pyridone-based 
mimic 9 (Scheme 1) was developed [117]. The design rationale is based on the 
assumption that the pyridone core structure carrying a methylene phosphonate 
moiety as phosphate mimic would result in a pK^^ profile close to that of pTyr 1. 
The tripeptide derivative 10 was shown to inhibit CD19 phosphopeptide binding 
to the p85a N-terminal SH2 domain with an IC 50 of only 50 pmol/1, while the 
pTyr-containing parent peptide is reported to inhibit this interaction with an 
IC 50 of 77 nmol/1, respectively. 




The dramatic loss in affinity when replacing pTyr 1 against the pyridone ana- 
logue 9 is attributed to an only moderate effect of the heterocycle on the pK^ 
profile of the attached methylene phosphonate, together with the disruption of 
7 T-stacking interactions normally seen between the pTyr aromatic ring and 
charged Arg sidechains aligning the binding pocket [117]. 



3.1.2 

Monocarboxylic Acids 

Non-phosphorus monocarboxylic acid-based pTyr mimics were developed in 
order to produce metabolically stable analogues that do not share the obvious 
disadvantage of two negative charges with the parent pTyr 1 . The most simple 
analogue cmF 11 [4-(carboxymethyl)phenylalanine] (Scheme 2) incorporated 
in a Src SH2 domain-targeted peptide was shown to have only poor affinity [54]. 
Considerably higher affinity was retained in a recent study aimed to find leads 
against the Lck SH2 domain [118, 119], although when compared to the pTyr- 




Scheme 2. Monocarboxylic acid-based pTyr mimics 
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14: X = CHj-COjH 



15: X = CF2-C02H 



18: X = O-CH 2 -CO 2 H 



Scheme 3. Grb2-targeted p turn peptide template used for evaluation of pTyr mimics 11, 13, 
and 17 



containing parent peptides, the cmF-derived analogues were generally more 
than two orders of magnitude less active [118, 119]. Despite the low affinity of 
cmF peptides, researchers at Boehringer Ingelheim Pharmaceuticals succeeded 
in deriving a high-resolution structure of Ac-cmF-Glu-Glu-Ile-OH 12 docked 
into the binding canyon of the Lck SH2 domain (Fig. 8) [119]. 

Similar results were obtained for Grb2 SH2 domain-directed peptidomime- 
tics containing cmF 11 as pTyr mimic [120, 121]. In order to enhance the pTyr- 
mimicking potential of cmF 11, a similar strategy as for the Pmp 2 ^ FjPmp 5 
transition was chosen to generate FjcmF 13 (Scheme 2). A Grb2 SH2 domain- 
targeted, p turn-stabilizing peptidomimetic developed at Novarits (see below) 
[68, 69, 122-124] has been used to assess the binding properties of cmF- and 
F2cmF-containing ligands (Scheme 3). 

Gontrary to initial expectations, introduction of fluorine into the bridging 
methylene group reduces affinity, as shown by 14 and 15 with IG50 values of 
0.6 pmol/1 and 2.0 pmol/1, respectively (Scheme 3) [120, 121]. 

In the context of Lck SH2 domain ligands it was demonstrated that the intro- 
duction of an additional hydroxyl group onto the bridging methylene group, 
yielding the chmF 16 [4-(carboxyhydroxymethyl)phenylalanine] (Scheme 2), 
improved the potency threefold when compared to the corresponding cmF ana- 
logue [119]. 

A constitutional isomer of chmF 16 is cmT 17 [0-(carboxymethyl)tyrosine] 
(Scheme 2) which showed dramatically reduced affinity when incorporated in 
the Grb2-selective p turn peptidomimetic 18, causing an almost 30-fold drop in 
affinity when compared to the cmF-derived analogue 14 (Scheme 3). Obviously, 
the carboxylate group is displaced from the phenyl ring in 18 to such an extent 
that the corresponding registry with the native pTyr sidechain is disrupted. The 
application of cmT 17 was extended from SH2 domain antagonists to protein- 
tyrosine phosphatase inhibitors [125], thus demonstrating the versatility of the 
pTyr-pharmacophore for signal transduction therapy. 

Apart from the monocarboxylic acid-based pTyr mimics, a sulfonic acid deri- 
vative 19 (Scheme 2) has been applied in Lck SH2 domain-targeting peptidomi- 
metics,but displayed only moderate binding [119]. 

A non-classical, but still rational and structure-based, screening approach 
was applied by researchers at Abbott Laboratories, utilizing the SAR-by-NMR 
technique [126, 127] to identify biophysically pTyr mimics for the Lck SH2 
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domain [128]. The major advantage of this particular NMR-based detection 
principle is the sensitivity, since low-molecular weight compounds can be iden- 
tified that bind in the micromolar to millimolar range. This allows one to screen 
for individual pTyr mimics prior to any synthetic efforts towards incorporation 
of these building blocks into any peptide or peptidomimetic scaffold. The 
binding affinity of Ac-pTyr-OEt 20 for the Lck SH2 domain was shown to be 
0.3 mmol/1. Screening of over 3500 compounds revealed a number of molecules 
binding to the pTyr subsite on the SH2 domain. 

Most interestingly, phthalamate analogues 21-23 were identified as poten- 
tially new pTyr mimics that may yield inhibitors equipotent to pTyr-containing 
compounds with the advantage of improved pharmacokinetic profiles. 




3.1.3 

Vicinal Dicarbonyls 

The design of a-carbonyl-aldehydes, -ketones, or -carboxylic acids is based on 
the rationale that an increase in the “spatial concentration” of oxygen atoms 
resembles more closely the hydrogen bond acceptor properties of a phosphate 
group. Researchers at Glaxo described a-dicarbonyl moieties attached to the 4- 
position of the phenyl ring of Phe as non-charged phosphate mimics 24, 25 that 
turned out to be two to three orders of magnitude less active when compared to 
a corresponding pTyr analogue in the Src SH2 domain binding assay [129]. 

Oxamic acid 26 and M-hydroxyoxamic acid 27 derivatives were disclosed by 
the Boehringer Ingelheim group as moderately active Lck SH2 domain antago- 
nists [119,130]. 
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3.1.4 

Dicarboxylic Adds 

A further step towards increasing the spatial density of oxygen functionalities 
was made by introduction of various malonyl-derivatives such as OMT 28 [ 0-(2- 
malonyl)tyrosine], or FOMT 29 [0-(2-(2-fluoromalonly))tyrosine] (Scheme 4) 
[131-133]. 

Linear hexapeptide ligands of the C-terminal p85a SH2 domain containing 
both malonate derivatives displayed 100-fold decreased binding when compar- 
ed to the pTyr peptide analogue [133]. Deletion of the phenolic oxygen atom in 
OMT 28 yields a new pTyr analogue, notably Pmf 30 [p-(2-malonyl)phenylalani- 
ne] (Scheme 4) [134]. Again, a ^ turn peptidomimetic inhibitor against the Grb2 
SH2 domain developed by Novartis [135] was used to test the new Pmf building 
block. When changing from the OMT-containing peptidomimetic to the Pmf- 
containing derivative, a 15- to 20-fold increase in potency was observed. Finally, 
an N- and C-terminal capped tripeptide mimic was obtained with an inhibitory 
potential of ICjo^ 8 nmol/1 that showed an effective blockade of endogenous 
Grb2 binding to phosphorylated erbB-2 growth factor receptor and inhibition of 
downstream MAP kinase activation in a whole cell system [134]. 

A structurally rather dissimilar pTyr mimic was developed by the Boehringer 
Ingelheim group, namely a triazole-dicarboxylic acid derivative 31 [130], which 
was shown to yield highly active Lck SH2 domain antagonists. 

3.1.5 

Miscellaneous Derivatives 

Researchers at ARIAD Pharmaceuticals described the incorporation of a 3,5- 
dinitrotyrosine 32 into peptide inhibitors of only double-digit micromolar af- 
finity to the SH2 domains of Syk and Src [136]. 
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Scheme 4. Dicarboxylic acid-type pTyr mimics 
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Due to the fact that the phosphate was replaced against entirely non-charged 
and phosphatase-stable analogues, the pTyr mimic is considered to have more 
favorable pharmacokinetic properties. However, aromatic nitro compounds 
bear an intrinsic potential of developing toxicological problems [136]. 

For designing covalent binding ligands, Glaxo employed the finding that 
within several X-ray and NMR structures of Src SH2 domain-peptide complexes 
the sidechain thiol group of Cys-188 is located in close proximity to the meta- 
position of the pTyr aromatic ring [137]. Since the proximity of that Cys residue 
to the ligand is unique for the Src SH2 domain, a thiol-capture strategy was 
chosen as a means to achieving selectivity for Src over other SH2 domains. Based 
upon the well-precedented inhibition principles of cysteine proteases [138], the 
aldehyde group was proposed to be a capturing agent by forming a thiohemi- 
acetal. Upon synthesis of the corresponding dipeptide derivative 33 (Scheme 5), 
its complex with the Src SH2 domain was investigated crystallographically [137]. 




Scheme 5. Thiol-capture strategy for covalent target binding 

As anticipated, 33 binds in the desired manner, capturing the thiol by the 
aldehyde group (Scheme 5). Unexpectedly, 33 binds with only a twofold im- 
proved affinity when compared to the parent dipeptide derivative containing 
the unmodified pTyr residue [137]. 

In conclusion, a variety of mutually different pTyr mimetics incorporated in 
peptide-type derivatives have been investigated. However, an overall satisfactory 
bioisosteric replacement has not yet been identified. 



3.2 

Peptidomimetic Scaffolding 

After the discussion of several approaches aimed at substituting the metaboli- 
cally unstable pTyr 1, the following sections will introduce lead finding and opti- 
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mization strategies with the ultimate goal of developing low-molecular weight, 
drug-like compounds. Due to the shortcomings related to solubility, stability, 
and bioavailability that generally arise with peptide-type analogues, most of the 
SH2 domain-targeted medicinal chemistry programs currently pursued apply a 
structure-based methodology towards an iterative design of small molecules 
with significantly reduced peptidic natures, rather than optimizing pTyr-con- 
taining oligopeptide sequences. In the following, the SH2 domain antagonist 
research programs will be presented according to the fragmentation of a peptide 
ligand depicted in Fig. 9. Rather than chronologically discussing the progress 
made by a certain research group for a special target, the compound classes will 
be introduced according to an increasing extent of de-peptidization. 



3.2.1 

Peptoids 

The design of peptide sequence-derived peptoids is generally considered as a 
rather tiny step on the route from a peptide lead towards a non-peptide devel- 
opment candidate, since the polyamide backbone is retained without any signi- 
ficant conformational restriction [139, 140]. The Novartis group has synthesized 
a biased tetrapeptoid library for the discovery of monodentate ITAM mimics as 
potential inhibitors for the ZAP70 tandem SH2 domain [141]. The resulting te- 
trapeptoids were expected to show considerable affinity in vivo, due to the im- 
proved absorption characteristics of the peptoid skeleton [110]. From a 27 com- 
pound library with a constant 4-(phosphonomethyl)benzyl sidechain (Pmp (2) 
analogue), only a single ZAP70 antagonist 34 was identified that exhibited 
moderate binding (IC 50 : 25 pmol/1). 




For comparison, the ITAM-derived 1 9-mer bidentate peptide inhibits with an 
IC 50 of 30 nmol/1 [141]. Concluding, the chosen peptoid approach did not yield 
in the desired breakthrough for the design of useful SH2 domain inhibitors. Also 
in terms of stepwise reduction of the peptide character, no information for the 
subsequent design of next-generation analogues could be derived [141]. 



3.2.2 

Non-Natural Amino Adds 



The first step in a strict hierarchical procedure for transforming peptide lead 
sequences into promising non-peptide analogues generally utilizes the incorpo- 
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IC 50 : 140 nM IC 50 : 124 nM IC 50 : 23 nM 
Scheme 6. Effects of a,a-disubstituted amino acids on Grb2 binding affinity 



ration of non-natural amino acids [142]. Based on the submicromolar Grb2 SH2 
domain-binding tripeptide derivative 35 (Scheme 6), the Novartis group sys- 
tematically explored the pTyr-l-1 position following a conformational rationale 
[123,143]. 

As shown in Fig. 10, the Grb2 SH2 domain is unique in that the peptide ligand 
is bound in a type I /I turn conformation, reinforced by an intramolecular hydro- 
gen bond. 

To strengthen the turn-inducing potential of next-generation Grb2 SH2 
domain ligands, a, a-dialkylated residues were probed in the i + 1 position of the 
reverse turn, since they are well-known to adopt kinked conformations as found 
in,e.g.,3io helices [144- 146]. From the X-ray structure of the Grb2 SH2 domain 
in complex with a peptide ligand (Fig. 10) [68] it was further shown that a hydro- 
phobic moiety in pTyr-l-1 position is accommodated by a complementary bin- 
ding pocket. Consequently, a series of hydrophobic a,a-dialkylated amino acids 
were introduced, resulting in a clear structure-activity relationship ranging 
from the least active Aib 36 to the most active Ac^c 40 (Scheme 6). The 1-amino- 
cyclohexanecarboxylic acid Ac^c 40 produced a compound with a 65-fold 
increased affinity when compared to the lie-analogue 35. The design rationales 
and underlying structural hypothesis that finally led to the proposed non-natu- 
ral amino acids 36-40 (Scheme 6) have been totally confirmed retrospectively 
by X-ray crystallography [147]. The complex of a peptide derivative containing 
40 in the pTyr+1 position reveals two functions for AcgC 40, notably the sta- 
bilization of the f3l turn conformation and the hydrophobic interaction with a 
binding pocket adjacent to the pTyr pocket [147]. These findings allowed 
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Fig. 10. Experimentally determined complexes of the Grb2 SH2 domain with a pTyr-con- 
taining oligopeptide {left, IBMB.pdb [71]) and inhibitor 52 {right, ICJl.pdb [147]). Even 
though 52 lacks the pTyr+3 residue for forming the fl turn conformation, the residues pTyr, 
Ac^c and Achec bind in the proposed mode to Grb2 



the Novartis group to proceed further towards high-affinity Grb2 SH2 domain 
antagonists that were derived from that intermediate analogue and displayed 
even in vivo activity (see below). 



3.2.3 

Capping Groups as Amino Add Analogues and Dipeptide Mimetics 

A convenient strategy to shorten peptide lead sequences is the utilization of 
amino acid sidechain-derived capping groups. By attachment of simple amines 
onto the C-terminus, and carboxylic acids onto the N-terminus, the charged ter- 
minal functionalities can be omitted. Additionally, introduction of tailor-made 
substituents into capping groups allows one to explore further binding pockets 
on the target protein surface, thus allowing one to increase affinity and specifi- 
city of resulting analogues. 

After successful replacement of a natural amino acid against AcgC 40 
(Scheme 6 ) in the Grb2 SH2 domain-targeted lead, the Novartis group decided 
to probe further the Asn-following position by attaching different amines of the 
indolylpropylamine-type onto the Asn C-terminus as shown in Scheme 7 [135]. 

In the context of Pmp-containing tripeptide derivatives, the 3-indol-l-yl-pro- 
pyl analogue turned out to exhibit high affinity. Substituting the indol 5-posi- 
tion with methyl or hydroxyl resulted in compounds 41 and 42 with subnano- 
molar IC 50 values (Scheme 7). These have been the first compounds of SH2 
domain antagonists with subnanomolar affinity reported. These systematic 
studies enabled Novartis to push further analogues into in vivo studies [148, 
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149]. The hydroxynaphthyl analogue CGP78850 43 was shown to block the epi- 
dermal growth factor receptor (EGFR)-Grb2 interaction in living cells. 

This analogue also inhibits the growth of cells transformed by receptor tyro- 
sine kinases which transmit a proliferative signal through Grb2 to Ras, but not 
cells transformed by oncogenic Raf or cells containing activating Ras mutations 
[148], thus clearly supporting the Grb2 SH2 domain to be the molecular target 
of CGP78850 43. To achieve an improved penetration of 43 across cellular 
membranes, the phosphonamide prodrug CGP85793 44 was synthesized. Upon 
hydrolysis by intracellular esterases, the active component GGP78850 43 is lib- 
erated from its precursor 44. The studies on 43 were extended mechanistically 
to interference into cell motility which generally correlates with oncogenic in- 
vasiveness and metastatic potential [150, 151]. It was demonstrated that inhib- 
ition of the Grb2 SH2 domain by 43 prevents HGF (hepatocytic growth fac- 
tor)/SF (scatter factor)-induced A431 and MDCK (Madin-Darby canine kid- 
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ney) cell motility and blocks the associated cytoskeletal rearrangement. CGP 
78850 43 is considered as a selective Grb2 SH2 domain antagonist that most like- 
ly opens an avenue for new cancer therapeutics, since it effectively controls the 
ability of tumor cells to move and traverse interstitial barriers. 

While Novartis designed C-terminally blocked tripeptide analogues 41-44, 
the Boehringer Ingelheim group succeeded in replacing a C-terminal dipeptide 
by a capping amine in their Lck SH2 domain antagonist program [152]. Based 
on the generic recognition motif -pTyr-Glu-Glu-Ile- the peptide portion down- 
stream to pTyr was systematically replaced yielding a series of dipeptide deri- 
vatives possessing remarkable affinities. Lck SH2 domain ligands were develop- 
ed that lack any carboxylic acid group. Interestingly, in the most active analogue 
45 the Glu in pTyr-l-1 position was changed to Leu, and the C-terminal -Glu-Ile- 
fragment could be replaced by an (S)-l-(4-isopropylphenyl)ethylamine. 



With a Kj of 0.2 pmol/1 45 turned out to be almost equipotent to a standard 
peptide analogue. The main advantage of 45 refers to the elimination of three 
distinct carboxylic acid groups (-Glu-Glu-Ile-OH) by a leucine amide. Since the 
affinity is still far from being acceptable, further optimization is ongoing in the 
Boehringer Ingelheim group [152]. 

A similar terminal-peptide-replacement strategy was used in the Src SH2 
domain antagonist program pursued at the Parke-Davis Pharmaceutical Re- 
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Scheme 8. Terminal-peptide-replacement strategy pursued by Parke-Davis 



search Division [153] over several years. Systematic replacement of pTyr and 
downstream portions of the recognition motif -pTyr-Glu-Glu-Ile- led to a series 
of peptidomimetic analogues. For example, the Ac-pTyr backbone part was 
replaced by a substituted succinic acid yielding compound 46 with retained affi- 
nity (IC50: 0.4 pmol/1) when compared to a standard lead peptide (Scheme 8). 
Replacement of the C-terminal dipeptide unit -Glu-Ile- against a D-homocyclo- 
hexylalanine amide (Hey) resulted in the Ac-pTyr series in compounds of only 
moderately decreased affinity (47 : IC50 =1.8 pmol/1), while N-methylation of the 
Hcy-preceding amide bond together with the deletion of the terminal carbox- 
amide brought back the original affinity (48: IC50 = 0.8 pmol/1) (Scheme 8). 

With compound 48 the original pentapeptide sequence pTyr-Glu-Glu-Ile-Glu 
of the standard peptide was reduced to a dipeptide derivative with comparable 
binding characteristics. In an attempt to remove the chiral center from the pTyr 
building block, a urea analogue was synthesized based on an N-substituted gly- 
cine derivative which caused a tenfold drop in affinity (49: IC50 = 7 pmol/1). The 
crystal structure of the urea derivative 49 in its complex with the Sre SH2 domain 
revealed an altered binding mode of the ligand with respect to the backbone con- 
formation (Fig. 11). Most strikingly, the C-terminal amide bond adopts a cis-con- 
figuration, thereby displacing a highly-conserved protein-bound water molecule. 

Due to the changed backbone conformation, several key interactions nor- 
mally found in SH2 domain-peptide complexes are not formed, thus demanding 
further optimization in order to meet the spatial requirements of the target 
more precisely [153]. 
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Fig. 11 . Experimentally determined 3D structure of the Src SH2 domain in complex with com- 
pound 48 (ISKJ.pdh [153]). The cyclohexyl moiety binds into the pTyr-l-3 binding pocket 
(right) 



Even though the Novartis group achieved promising in vitro as well as in vivo 
results with compounds of type 43 against the Grb2 SH2 domain, they made fur- 
ther attempts to reduce the peptide nature. Starting from the N-terminally 3- 
amino-Z-protected tripeptide amide 50 (IC 50 : 1 nmol/1, Scheme 9), efforts have 
been made to replace the Asn in pTyr-l-2 position [147]. A computer-aided sub- 
structure search in the Novartis in-house database uncovered the cis-2-amino- 
cyclohex-3-enecarboxylic acid 51 (Achec, Scheme 9) to be a surrogate for the 
Asn residue in 50. 




Scheme 9. Asn-replacement strategy utilizing the ^-amino acid 51 
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Upon synthesis of the dipeptide derivative 52, the (lS,2J?)-isomer showed a 
comparable inhibitory potential with an IC 50 of 1.6 nmol/1. Interestingly, the 
dipeptide mimic with the enantiomeric /? amino acid (lJ?,2S)-Achec showed no 
binding at all. This study can be taken as a textbook example of a successful 
structure-based design approach utilizing a combination of molecular model- 
ing, virtual screening of databases, medicinal chemistry, and experimental 
structure determination [147]. The binding mode of 52 to the Grb2 SH2 domain 
was shown by X-ray crystallography to be in perfect agreement with the predic- 
tions from molecular modeling (see Fig. 10) [147]. With 52, an analogue has been 
developed that reaches nanomolar binding without addressing the binding 
pocket of the pTyr-l-3 residue (see, for example, 41-44). While 52 is still a tri- 
peptide analogue, alternative building blocks were evaluated that allowed for a 
simultaneous replacement of the C-terminal -Xaa-Asn- dipeptide fragment. By 
interactive design based on crystal structures of Grb2 SH2 domain-ligand com- 
plexes, the Novartis group replaced the terminal dipeptide against a 1,3-di- 
substituted phenyl ring (53 in Scheme 9) with the pTyr-l-1 sidechain attached 
to the aminomethyl substituent and a urea as Asn sidechain surrogate [154]. 
Unfortunately, the binding affinity of 53 was only in the micromolar range. 
However, this result proves it is possible to replace a dipeptide fragment with a 
single optimized non-peptide building block, yielding an SH2 domain ligand 
(53) that can be classified as a single amino acid derivative. 

3.2.4 

C-Terminal Tripeptide Mimetics 

Apart from the Grb2, the Lck SH2 domain also served as target for peptidomi- 
metic compounds that were designed along the line of replacing the entire C-ter- 
minal peptide sequence from the generic recognition motif -pTyr-Glu-Glu-Ile- 
by non-peptide structures [155, 156]. Applying a combinatorial chemistry 
approach, two libraries were designed and synthesized. An 84 compound library 
was used to identify the optimal -Glu-Glu- replacement, while a 900 compound 
array revealed a suitable He mimic. The pTyr derivative 54 was identified as one 
of the most active molecules of that series with an IC 50 of 1.4 pmol/1 [155, 156]. 

As in 53 (Scheme 9), the residual peptide sequence C-terminal to pTyr was 
exchanged against a non-peptide moiety. However, the low affinity of 54 can 
probably be attributed to an increased flexibility. Comparing 54 with the Grb2- 
targeted peptidomimetics 41, 42, or 43, it is reasonable to assume that the 
module C-terminally attached to pTyr in 54 serves as a tripeptide mimetic. 

The Boehringer Ingelheim group chose a pyridone-based core structure to 
replace the -Glu-Glu-Ile- tripeptide of the generic SH2 domain recognition 
sequence [130]. From a series of more than 200 analogues, pyridones with 
mutually different decoration patterns were identified to result in pTyr deriva- 
tives with submicromolar affinities (e.g., 55, 56) for the Lck SH2 domain [130]. 

By focusing the entire compound collection on the pTyr-pyridone scaffold, 
the researcher made use of a “privileged” structural element, since the pyridone 
was repeatedly utilized to induce p strand conformations in serine and cysteine 
protease inhibitors [157, 158]. From this point of view, the potential of a modu- 



46 



G. Muller 




lar chemistry concept relying on versatile peptidomimetic building blocks for 
rapid and efficient lead finding becomes apparent. 

Researchers at ARIAD Pharmaceuticals, interested over several years in de- 
veloping Src SH2 domain inhibitors into therapeutic drugs, followed a strategy 
that would allow one to replace the C-terminal tripeptide of the generic reco- 
gnition motif by a non-peptide module [159]. The design explicitly accounted 
for a hydrophobic protein surface patch separating the two “holes” of the 
“socket” that accommodate the “two-pronged plug-type” ligand. Consequently, 
a scaffold was required that accounted for that hydrophobic area, while appro- 
priately delivering the two relevant ligand sidechains (pTyr and pTyr-l-3) into the 
corresponding pockets [ 159]. The initial design attempts resulted in a 1,3-disub- 
stituted phenyl ring core that was decorated with the interaction-mediating 
groups as depicted in Scheme 10 (57; ICso^ 769 pmol/1; 58: ICso^ 306 pmol/1 
(diastereomeric mixture)) [159]. 
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Scheme 10. Scaffolding the interaction-mediating sidechains on cyclic hydrophobic core tem- 
plates 



Due to synthetic reasons, the cyclic core was changed in favor of a heterocyc- 
lic scaffold, yielding the thiazole-based mimetics 59 (IC50: 483 pmol/l) and 60 
(IC50: 26 pmol/l) [ 159]. Even though the binding affinities could not be improved 
with respect to the control peptide (IC50: 6 pmol/1), a new scaffold replacing the 
C-terminal tripeptide was successfully designed (Scheme 10) and proven by 
X-ray crystallography to bind in the predicted manner [159]. These results served 
as an initial starting point for a series of follow-up studies at ARIAD Pharma- 
ceuticals. Apart from the 2,4-substituted thiazoles as in 59 or 60 (Scheme 10), the 
study was extended to different heterocyclic core structures, such as 2,5-disub- 
stituted thiazoles 61 (IC50: 12 pmol/1), or 3,5-disubstituted 1,2,4-oxadiazoles 
[160]. A systematic comparison of different heterocyclic scaffolds suggested that 
the heterocycles were interchangeable without significant alterations in affinity, 
as long as a comparable decoration pattern was retained [160]. 




61 
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Since the oxadiazole series allowed an acceptable synthetic flexibility in terms 
of analogue preparation, this series was explored systematically [160]. Varia- 
tions of the C-terminal lipophilic groups yielded analogues with affinities com- 
parable to the standard peptide (IC50: 6 pmol/1). Exchange of the Glu sidechain 
(e.g., 60: ICjo^ 26 pmol/1; 62: ICjo^ 7 pmol/1) against the Abu sidechain (61: 
ICso^ 12 pmol/1), or the Trp sidechain (63: ICso^ 16 pmol/1) resulted in a two- 
fold loss in affinity, but allowed one to reduce the overall charge of the peptido- 
mimetics, rendering these analogues as promising leads for further optimiza- 
tion [160]. 

Exactly along this line of modification, ARI AD Pharmaceuticals succeeded in 
generating an Src SH2 domain antagonist containing a bicyclic core element ser- 
ving as tripeptide mimic, notably AP221 61 64 [161]. 




o o 

64 

Reminiscent of the covalent inhibition principle employed by Glaxo with 33 
(Scheme 5),AP22161 64 covalently binds to the target SH2 domain. As well as 64 
exhibiting a reasonable in vitro inhibition with an IC50 of 0.24 pmol/1 against the 
Src SH2 domain, it was also shown in vivo to diminish the osteoclast resorption 
activity in a mechanism-based cellular assay [161]. The concept of heterocyclic 
moieties bridging the distance between the two binding sidechains was extra- 
polated by ARIAD Pharmaceuticals from the Src SH2 domain onto the next 
representative of the target family, i.e., the ZAP70 SH2 domain [162]. Again the 
lead finding efficiency was significantly increased by developing privileged 
peptidomimetic building blocks that account for the target family-wide mole- 
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cular recognition characteristics, followed by a repeated use of these modules in 
different medicinal chemistry programs. Variation of the C-terminal lipophilic 
group attached to C-3 of the oxadiazole core (e.g., 62, 63) allowed the modula- 
tion of the SH2 domain specificity [162]. In a systematic iterative optimization 
procedure, detailed structure-activity relationships were obtained that finally 
resulted in a series of ZAP70-selective peptidomimetics, such as 65 (IC50: 
8 pmol/1) and 66 (IC50: 1 pmol/1), respectively [162]. 



As in all cases discussed here, a tripeptide fragment was exchanged against a 
non-peptide module, resulting in considerably promising peptidomimetics. 

3.2.5 

Non-Peptide Peptidomimetics 

The ultimate goal of any peptidomimetic approach is a non-peptide compound 
that more closely resembles the attributes of commonly used drug molecules, 
i. e., the peptidic origin is not apparent at first glance. Currently, only a few exam- 
ples of SH2 domain-targeted non-peptide peptidomimetics have been reported, 
reflecting the novelty of this particular research area. From the main protago- 
nists, Parke-Davis [163] and Novartis [164] published low-molecular weight 
analogues that can be classified into the group of non-peptide peptidomimetics. 

Parke-Davis targeted a template structure with no formal charge replacing 
the Glu-Glu dipeptide bridging pTyr with He (pTyr-l-3), since the X-ray structure 
of SH2-peptide complexes suggested that the contribution of the two charged 
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Scheme 11. Development of non-peptide Src SH2 domain antagonists emerging from a com- 
puter-aided molecular design strategy 



carboxylates was nonessential for binding due to their solvent-exposed orienta- 
tion [64]. The pTyr sidechain bound to its binding pocket was used as anchoring 
point for a protein-based de novo design of nonpeptide ligands. The design 
strategy aimed to incorporate a fairly rigid group as bridging moiety with de- 
fined hydrogen bonding capabilities. After definition of these structurally- 
derived geometric restraints, a database search of the Cambridge Crystal- 
lographic Database [165] revealed a benzoxazinone bicyclic core 67 as a high- 
ranking candidate (Scheme 11) [163]. 

The ring encoded cis-amide would further displace the aforementioned pro- 
tein-bound water molecule, thus entropically contributing to binding affinity. 
Structurally fine-tuning and estimation on synthetic feasibility prompted the 
research group at Parke-Davis to follow a ring-opened form of the initial “vir- 
tual hit”. The compound 68 is the first to be synthesized from that molecular 
modeling study and exhibited a remarkable affinity of 9.7 pmol/1 (Scheme 11). 
Further optimization yielded a series of compounds with single-digit micromo- 
lar affinities for binding the Src SH2 domain, e.g., 69 (IC50: 6.5 pmol/1), none of 
which reveals an obvious peptidic origin (Scheme 11) [163]. X-ray studies of 
these ligands in complex with the target SH2 domain revealed the excellent pre- 
dictions made by molecular modeling. With these de novo designed compounds 
the first cycle in the iterative structure-based design application was completed 
and the information is further used to guide next-generation analogue design to 
improved levels of potency [163]. 

The Novartis group used the X-ray structure of a Grb2-peptide complex [68] 
as the structural basis for a design attempt that yielded entirely new non-pep- 
tide SH2 domain ligands [164]. As mentioned several times throughout this con- 
tribution, the interaction of the pTyr sidechain and the Asn sidechain in pTyr-l-2 
position of the peptide ligand have been identified as key elements for molecu- 
lar recognition (see Fig. 10). The obvious relevance of these two sidechain func- 
tionalities allowed the definition of a minimal pharmacophore pattern that 
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Scheme 12. Design of aminopyrimidines as Asn sidechain mimetics 



served as structural basis for seeking non-peptide molecules containing these 
pharmacophoric elements. Based on thorough molecular modeling studies, the 
Asn sidechain was replaced by a 4-aminopyrimidine moiety, the latter being 
capable of retaining the hydrogen bond network formed in the complex struc- 
ture (Scheme 12). 

Further modeling studies led to the de novo design of entirely non-peptidic, 
“drug-like” compounds, 70 and 71 that were synthesized and biologically char- 
acterized [164]. It turned out that the open-chain compound 70 was inactive 
at concentrations up to 200 pmol/1, while the conformationally constrained 
thiazole analogue displayed binding affinity (IC50: 25.9 pmol/1) which is within 
the range of the affinity of the reference tripeptide analogues (Scheme 12) [164]. 

These examples clearly prove the viability of a structure-based peptidomi- 
metic design approach for developing non-peptide peptidomimetics for thera- 
peutic interference into protein-protein interaction events. 

4 

Versatility of the Phosphotyrosine Pharmacophore 

The recognition of pTyr-containing protein segments by downstream signaling 
adaptor proteins or enzymes is a central theme in intracellular signal transduc- 
tion. As outlined in this contribution, selective inhibition of SH2 domains by 
low-molecular weight compounds has attracted considerable interest in recent 
years. A great deal of the synthetic work has been attributed to the development 
of useful pTyr mimics (Sect. 3.1). Future medicinal chemistry programs will 
undoubtedly benefit from the pioneering studies reviewed in this contribution, 
not only for potential SH2 domain antagonists but also for a relatively new class 
of pTyr-binding target proteins, i.e., the PI (phosphotyrosine-interaction) 
domains, more commonly termed PTB (phosphotyrosine-binding) domains 
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[26, 166]. Comparable to SH2 domains, PTB domains recruit signaling proteins 
to the vicinity of the intracellular domains of activated receptors. The first PTB 
domains were identified in two prominent RTK substrates, i.e.. She and IRS-1 
(insulin receptor substrate) [26,27, 167, 168]. The association of PTB-containing 
proteins with signaling proteins involved in a variety of human diseases sug- 
gests that PTB domains might serve as novel promising targets for therapy 
[169]. For example, the PTB-containing multidomain proteins FE65, XI 1, and 
Dab bind to a cytoplasmic portion of the amyloid precursor protein (APP) 
[170-175] that is important for APP processing and internalization. At least 
FE65 and XI 1 antagonists are believed to block amyloid plaque formation [176] 
through inhibition of APP processing and may thus serve as useful therapeutics 
for Alzheimer’s disease. 




Fig. 12. 3D Structure of a pTyr-containing oligopeptide bound to the IRS-1 (insulin receptor 
substrate) PTB domain (lIRS.pdb). The Asn-Pro-Ala-pTyr tetrapeptide sequence adopts a 
regular fil turn conformation [181] 
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Although PTB domains and SH2 domains are both capable of phosphoryla- 
tion-dependent binding [177], their ligand “read-out” and binding modes are 
substantiahy different. Specificity on PTB domain interaction is conferred by 
sequences N-terminal to pTyr (Fig. 12), rather than C-terminal as found for SH2 
domains. PTB domains were identified based on their ability to recognize and 
bind -Asn-Pro-Any-pTyr- sequences [178]. 

Even though the strictness for a phosphorylated Tyr sidechain seems less 
pronounced for PTB domains [179], a modular peptidomimetics-based lead 
finding strategy could profit immediately by the knowledge and experiences 
gathered in SH2 domain-targeted medicinal chemistry projects. 

Summarizing, pTyr emerged as a versatile pharmacophoric element over the 
last five years involved in the basic processes of intracellular signal transduc- 
tion. Bioisosteric pTyr mimetics would not only allow one to interfere in the for- 
mation of activated multiprotein signaling complexes by blocking the adaptor 
protein recognition sites, but could also provide useful tools for PTP inhibitors. 
A certain degree of systematization of biomedical research projects from a 
target family point of view is required to permit cross-fertilization in terms of 
exchanging privileged building blocks and synthetic strategies, thus maximiz- 
ing efficiency in lead finding and optimization. 

5 

Conclusion and Perspective 

This contribution was intended to highlight pTyr-containing peptidomimetic 
compounds designed to interfere with signal transduction pathways by blocking 
protein- protein interactions mediated by SH2 domains. In addition, they bear a 
significant potential to be also applicable as PTB domain antagonists. Up to now, 
the majority of efforts aimed at developing specific SH2 domain antagonists 
have failed to reach an advanced stage of preclinical or even clinical develop- 
ment, due to the problems associated with generating nanomolar or subnano- 
molar binding entities. Most strikingly, a satisfactory structural replacement of 
pTyr has not yet been found. However, the design of therapeuticahy promising 
lead structures for SH2 domains presents novel challenges for medicinal chemi- 
stry. The excehent studies conducted by research groups at, among others, 
Novartis, Parke-Davis, ARIAD, and Boehringer Ingelheim clearly prove the via- 
bility of signal transduction therapy, which is undoubtedly still in its infancy. It 
remains difficult, if not impossible, to predict the outcome of a selective knock- 
out of a distinct signal transduction element, such as an SH2 or a PTB domain. 
The applicability of adaptor protein-targeted inhibitors for therapeutic inter- 
vention in human diseases will not be proven until the discovery and subsequent 
clinical evaluation of potent and selective inhibitors of these pathways. The re- 
search projects discussed in this review will contribute to reaching this goal. 
Apart from the biomedical aspects addressed in this contribution, the area of 
SH2 domain antagonists turns out to be a textbook example of modern medi- 
cinal chemistry. According to a recent definition [180], this is still a chemistry- 
based discipline, being increasingly influenced by related areas such as 
biochemical, medical, pharmaceutical, and biophysical sciences. Instead of 
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applying the traditional “hit-or-miss” method of finding new leads, a rational 
approach was preferred in all studies discussed, taking into account the avail- 
able structural and functional information on the involved molecular com- 
ponents. 
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Ras is a central switch in regulation of cell proliferation and differentiation. It becomes 
activated by extracellular stimuli like growth factors and relays the signal into diverse cellular 
pathways. A balanced action of regulatory proteins is required to maintain the desired level of 
Ras activity. Oncogenic mutations in Ras lead to uncontrollable over-function, resulting in 
transformation of the cell. Therefore Ras became a target in many searches for anti-cancer 
agents. From the biochemical point of view Ras serves as the paradigm of the superfamily of 
GTP-binding proteins which fulfill cellular functions as diverse as transmission of vision and 
regulation of cell morphology. The common theme is switching between the GTP- and GDP- 
bound forms which correspond to the active and silent states, respectively. This article descri- 
bes the function of Ras on the molecular level, leaving the enormous wealth of biological data 
in the Ras field largely aside. Rather, the focus is put on biochemical and biophysical methods 
which were used to analyze the interactions with the diverse partner molecules in a quantita- 
tive way to elucidate the mechanism of activation and regulation. 

Keywords: Ras, Kinetics, Structure, Effectors, Regulators, GTPase, Fluorescence, Membrane 
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1 

Introduction 

1.1 

Signal Transduction 

All living organisms have two primary objectives: to maintain their vital func- 
tions and to relay their genetic information by reproduction. To exercise this 
task successfully a creature has to react flexibly upon its environment. It has to 
escape from enemies, it has to adapt to changes in temperatures like heat or 
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coldness, drought or moistness. If food is available the metabolism has to be 
switched quickly for an optimized synthesis of an energy reserve and in the case 
of an injury damaged tissue has to be regenerated. 

Multicellular organisms are especially constrained to allow cell division just 
at that time and place where it is necessary to build up new or enhance present 
functional structures during growth and differentiation. 

Signal transduction now ensures that information is selectively addressed 
to single cell types, where it is registered and transformed in a common 
manner - e.g., in the case of a human being which is composed of some tril- 
lions of cells [1]. Hereby metabolic fluxes or cell division are controlled effi- 
ciently. 

Animal cells are separated from each other by lipid membranes. During sig- 
nal transduction this barrier has to be passed, which can be realized by perma- 
nently or temporarily opened channels or by an indirect mechanism without 
material flux between the extra- and intracellular lumen (Fig. 1). 

In the latter case an extraneous messenger has to dock at the receptor’s extra- 
cellular binding site on the cell surface. The information about the occupancy of 
the corresponding receptor is transmitted through the transmembrane part of 
the protein into its cytosolic domains by conformational changes. This struc- 
tural response can be induced by an additional dimerization and results in a 
covalent modification of intracellular side chains. The new conformation is then 
recognized by cytosolic partner molecules. In this connection GTP binding pro- 








Fig.lA -C. Communication between cells can be realized in three principal modes: A direct 
exchange of material via Gap Junctions is used to synchronize, e.g., heart muscle cells; B ligand 
and voltage-gated channels open for a short period of time to allow ion flux into or out of the 
target cell in response to ligand binding or a change in membrane potential, respectively; 
C indirect transmission of a signal. A transmembrane receptor is in its inactive state when no 
messenger is around. If a messenger molecule binds at the outer site of the receptor a change 
in conformation is induced. Now further proteins can bind to the intracellular site of the 
receptor and transmit the information 
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teins often work as an essential molecular switch [2]. GTP binding proteins are 
classified in five superfamilies: 

- Translation factors in protein synthesis 

- G„ subunits of heterotrimeric G-proteins 

- Ras-like proteins of small GTPases 

- Large GTP-binding proteins 

- Subunits of signal recognition particles and their receptors 

All GTP binding proteins in signal transduction share a common structural ele- 
ment - the “Ras-like” domain which is responsible for the specific complexation 
of guanosine diphosphate and -triphosphate and which contains catalytic resi- 
dues that promote GTP-hydrolysis. 



1.2 

Heterotrimeric G-Proteins 

G-Protein coupled receptors (GPCR) represent the start element in secondary 
messenger producing systems. They comprise a family of over 1000 structurally- 
related members. These membrane proteins are also called serpentine or seven- 
helix receptors due to their seven transmembrane domains with an a-helical 
conformation. Receptors belonging to this class respond to a variety of hormo- 
nes and neurotransmitters, and they detect odorant molecules or light [3, 4]. 

The ligand complexed form of a GPCR allows binding of a heterotrimeric G- 
protein [5], which is anchored at the cytoplasmic side of the plasma membrane 
via hydrophobic modifications in two out of their three subunits. Binding to the 
receptor induces dissociation of the tightly bound GDP from the a subunit (G„) 
of the G-protein, which is then replaced by cytosolic GTP. This step leads to dis- 
sociation of G„from the G^y dimer. G„ • GTP now can interact with effector pro- 
teins, which themselves may generate second messenger molecules in the cell. 
Intrinsic a subunit GTPase activity catalyses GTP hydrolysis causing inactivati- 
on. GDP-bound a subunit re-associates with the G^j, subunit to form an inactive 
G protein heterotrimer which can again become activated by the receptor 
(Fig. 2). 

So far four G protein subfamilies have been identified and classified accord- 
ing to the more than 20 known a subunits (Gq/11, Gi/0, Gs, and G12/13) [6]. 
There is a similar variety of beta and gamma subunits. G proteins can act either 
stimulatory or inhibitory. 



1.3 

Small GTP-Binding Proteins 

The Ras superfamily of GTP-binding proteins is composed of several subfami- 
lies [7] which all contain the “Ras-like domain” of approximately 160 amino 
acids and 5 consensus sequences. Two of these highly conserved motifs are res- 
ponsible for specific recognition of the guanosine nucleotide, and three are 
necessary for binding of the phosphate groups and complexation of a Mg"^"^ ion, 
which is found in all Ras-like proteins. 
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Fig. 2. An adrenaline molecule (1) binds to its binding site on the extracellular site of an 
adrenaline receptor (2). Thereby, the exchange of GDP by GTP in the G„ subunit of a hetero- 
trimeric G protein (3) is induced, followed by the dissociation of the G^ and Gpy subunits. G„ 
now binds and stimulates its effector adenylate cyclase (4), which produces cyclic AMP (5) 
from ATP (6). This second messenger starts a cascade of enzymatic reactions, which alter the 
behavior of the cell via several phosphorylation steps 



The Ras and Rho proteins are involved in signal transduction; Ran, Rab and 
Arf proteins regulate events of the intracellular transport. The function of Rad- 
proteins is not yet clear. 

All proteins of the Ras superfamily (Table 1 ) - with the exception of the Ran 
protein - undergo a posttranslational modification which introduces at least 
one hydrophobic modification. These lipid anchors qualify the members of the 
Ras family as peripheral membrane proteins, which stick into membrane struc- 
tures from the cytoplasmic side only. 



Table 1. Members of the Ras-superfamily 



Family Function 



Ras 

Rho 

Rab 

Arf 

Ran 

Rad 



Cell growth and differentation 
Organization of the cytoskeleton 
Vesicular transport 

Vesicular transport between endoplasmatic reticulum and cis-Golgi apparatus 
Nucleo-cytoplasmic transport 



66 



J. Kuhlmann • C. Herrmann 



1.4 

Ras 

Ras is the most prominent member of the small GTP-binding proteins. Homo- 
logs are found in all eukaryotic organisms and it is present in all types of cells. 
Four isoforms of Ras are known in human, namely H-Ras, N-Ras, and K-Ras4A 
and -4B. Their sequence is almost identical, with only the 20 C-terminal amino 
acids representing a hypervariable region. No clear evidence is available as to 
whether they are redundant in biological function or are responsible for diffe- 
rent signal pathways. However, different effects of knock-outs of these genes in 
mice, which is lethal only when K-ras is disrupted [8], suggest a major impor- 
tance for K-Ras. Recently, differential activities of the four isoforms with respect 
to cell transformation and stimulation of cell motility have been reported [9] 
(see [10] for review). 

Ras plays a pivotal role in cellular signal transduction, as many different outer 
cellular inputs are relayed to Ras inside the cell and are transformed into speci- 
fic biological responses by Ras activated signal pathways [ 1 1 ]. Dependent on the 
cellular context, this leads to cell differentiation or proliferation. Originally, Ras 
was discovered as an oncogene causing cancer (rat sarcomas [12]). Due to its 
potential when mutated to transform cells in the end to tumor cells, Ras is classi- 
fied as a proto-oncogene. The molecular mechanism and aspects important 
for the biological function of Ras, like its cellular localization, its regulator and 
target proteins, are described in the following sections. 



1.4.1 

Membrane Anchorage 

Ras is strictly localized to the inner side of the plasma membrane. A lipid anchor 
covalently attached to the C-terminus of Ras penetrates into the lipid bilayer. 
This membrane anchorage is essential for the biological activity of Ras. Hence, 
the inhibition of anchor attachment has become an attractive pharmacological 
target [13]. See: Waldmann H, Thutewohl M, Ras-Farnesyltransferase-inhibitors 
as promising anti-tumor drugs, this volume. 

The primary target for postranslational modification is the so-called Caax 
box acting as a prenylation signal. These four letters represent the four C-termi- 
nal amino acids of almost all Ras-related proteins indicating a Cys and two ali- 
phatic amino acids. Where x is a Ser or Met like in Ras the protein becomes far- 
nesylated and where x represents Leu or Phe a geranylgeranyl group is attached. 
Ras becomes modified presumably immediately after synthesis. In the first 
postranslational modification step a farnesyl group is covalently linked to Cys- 
186 of Ras. This is catalyzed by farnesyl transferase localized in the cytoplasm 
and taking farnesylpyrophosphate as substrate [14]. Second, the three C-termi- 
nal amino acids aax are cleaved off by a prenylprotein specific endoprotease 
[ 1 5 ] . In the third step, the cystein is carboxymethylated by carboxymethyl trans- 
ferase [ 1 5 ] . These two enzymes are lo cated to the endomembrane system [16,17]. 

As soon as Ras sticks to the plasma membrane another lipid anchor is at- 
tached to it. A putative palmitoyl transferase which is assumed to reside in the 
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plasma membrane catalyses the thio-esterification of another cysteine close to 
the C-terminus [18]. This process leads to trapping of Ras at the cell membrane 
[19]. K-Ras4B, which does not become further modified after carboxyme- 
thylation but which carries a set of six lysins in the hypervariable region, 
experiences enhanced binding to the negatively charged plasma membrane. 
Recently, a more complex model of Ras membrane targeting was suggested, 
involving intermediate localization of farnesylated Ras to the endomembrane 
system and vesicular transport of Ras to the plasma membrane [20]. Never- 
theless, two signals are necessary for final docking of Ras to the cell membrane, 
i.e., farnesylation and palmitoylation or, alternatively, a polybasic stretch of 
amino acids. 



1.4.2 

GTPase Cycle 

Ras is a GTPase with a weak intrinsic activity. It binds with similar, high affinity 
to both GTP and GDP, with a slow rate of dissociation. Due to its high molar 
excess in the cell it is prevailingly GTP which binds to Ras after dissociation of 
GDP. The biological function of Ras is based on the two different conformations 
it adopts in the GTP- and GDP-bound state, respectively. After GTP hydroly- 
sis the release of the phosphate group triggers a conformational switch of 
two peptide stretches in Ras which were non-covalently bound to the y-phos- 
phate of GTP [21, 22]. As the change between the GTP- and GDP-bound form 
turns the biological activity on and off, respectively, Ras is often referred to as a 
molecular switch. In its resting state Ras is bound to GDP and only the GTP- 
bound form is able to bind strongly to effector molecules which become activa- 
ted in order to transmit further the signal in their pathway (see Fig. 3). Ac- 
cording to the following equation the relative amount of Ras bound to 
GTP and hence its biological activity, depends on the GTP hydrolysis and GDP 
dissociation rates which may be up-regulated as described in the two following 
sections: 

[Ras • GTP] ^ 

[Ras • GTP] + [Ras • GDP] ^diss ^hyd 



1. 4.2.1 

Exchange Factors 

Ras activation, i. e., exchange of the bound GDP for GTP, is accelerated by guani- 
ne nucleotide exchange factors (GEF) (see Fig. 3). Sos is ubiquitously expressed 
whereas other GEFs are found in some tissues only [7]. Activation of Ras occurs 
after extracellular stimulation of the cell: for instance hormones or growth 
factors bind to their specific receptors at the outer cell membrane, leading to 
tyrosine kinase activity of the transmembranal receptor at the cytoplasmic 
side of the membrane. Autophosphorylation of receptor tyrosine residues 
leads to docking of the SH2 domain of Grb2 which, on its part, recruits Sos to 
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Ras*GDP 




Raf 





V 

V 




Fig. 3. Ras represents a typical GTPase, being active in the GTP-bound state and silent when 
GDP-bound. GTP loading can be accelerated by guanine nucleotide exchange factors (GEF), 
activating Ras, whereas GTP hydrolysis is upregulated by GTPase activating proteins (GAP), 
thereby turning Ras “off”. In the “on”-state Ras can activate effector molecules like Raf 
through binding to the Ras binding domain (RED) which seems to be common to (most) 
effector proteins. The cellular response in the case of Raf activation is - after a cascade of 
phosphorylation reactions - the activation of DNA transcription (TF = transcription factor in 
the nucleus) leading to cell division. Note that Ras is permanently located at the inner plasma 
membrane (not indicated here) 



the membrane by binding with its SH3 domain to proline-rich regions in 
Sos. Three features common to many signaling pathways should be stressed 
here: 

1. Phosphorylation leads to enhanced (or decreased) enzymatic activity or to 
altered binding affinities. 
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2. SH2 and SH3 domains refer to src homology 2 and 3, a non-receptor tyrosine 
kinase where these domains were identified originally. They bind to phos- 
phorylated tyrosine and proline rich epitopes, respectively, with specificities 
each in respect to the neighboring amino acids. Grb2 is an adapter molecule 
par excellence having both SH2 and SH3 domains (see also the respective 
article in this volume). 

3. The third feature is the recruitment of a molecule, in this case Sos, to the 
membrane in order to fulfill its function. Both increased local concentrations 
at the membrane and two-dimensional diffusion of the collision partners, Sos 
and Ras, lead to enhanced encounter of the two proteins and thus to accele- 
rated nucleotide exchange on Ras. 

Other activation systems have been identified lately, like activation of RasGRP 
(guanine nucleotide releasing protein) by diacyl glycerol/Ca^"^ [23, 24]. Also 
Cdc25““/RasGRF (guanine nucleotide releasing factor) is stimulated by Ca^"^- 
induced binding of calmodulin to its IQ-motif [25]. 



1.4.2.2 

GTPase Activating Proteins 

Ras returns to its resting state by hydrolysis of its cofactor GTP. As mentioned 
above, the GTPase activity of Ras is low but it can be enhanced up to 10^-fold by 
the interaction with a GTPase activating protein (GAP) [26, 27] (see Fig. 3). The 
two most investigated GAPs in humans are pl20‘^^^ and NFL Whereas the for- 
mer is thought to have an additional effector function [28, 29] the latter is clas- 
sified as a tumor suppressor gene as disruption of its gene leads to neurofibro- 
matosis [30]. This fact demonstrates the importance of having a regulatory 
machinery that not only activates Ras but which also allows it to be efficient- 
ly turned off. As with GEFs it is probably essential to have GAPs for all small 
GTPases for proper regulation and, in fact, specific GEFs and GAPs are known 
for many other Ras-related proteins. 

The oncogenic potential of Ras is based on a gain of function, i. e., permanent 
activation of effectors, which actually is due to a loss of enzymatic activity [31]. 
Ras’ Achilles heel is represented by residues Glyl2 and Gln61. When either of 
these residues is mutated, Ras turns oncogenic which is based on the impair- 
ment of its GTP hydrolysis activity [31]. Moreover, GAPs cannot act on Ras any 
more [26]. This results in a permanently high concentration of Ras • GTP lea- 
ding to permanent, unregulated activation of the downstream targets and hen- 
ce to uncontrolled cell growth. Together with mutations of some other genes like 
p53 the cell becomes transformed which finally results in tumor growth. 
Although oncogenic Ras alone is not responsible for cell transformation but 
only in concert with a few other genetic disruptions [1, 32, 33], Ras is one of the 
most frequent oncogenes. It is found in 30% of all human tumors [34], playing 
a prominent role, e.g., in colon (50%) and pancreatic cancer (90%), apparently, 
being of little importance in, e.g., breast cancer. Designing compounds which 
re-establish GTPase activity of oncogenic Ras and its susceptibility to GAP 
represents another approach in the development of anti-cancer drugs directed 
against Ras [35,235]. 
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1.4.3 

Effectors 

After Ras has been activated, the signal is further transduced by interaction of 
Ras • GTP with downstream targets. Obviously, Ras may accomplish the diverse 
cellular functions which are known for it by activating various signal branches. 
Each of these pathways experiences additional regulation by inputs from other 
signal molecules so that the actual result of Ras activation depends on the phy- 
siological context of the cell. An intriguing finding was the partial loss of Ras 
function by distinct mutations in its effector region [36-39]. Some residues are 
essential for the interaction with one effector and others are more important for 
the interaction with another effector. Such mutants are useful to study the bio- 
logical effect of suppressing one of the signal branches emerging from Ras. 

Ras effectors are characterized by their nucleotide-dependent interaction 
with Ras, which is impaired by mutations in the effector region of Ras (e.g., 
D38A), and by their competition with GAP in binding to Ras suggesting that 
effectors and GAPs bind to the same or overlapping sites on Ras [26, 40]. In 
recent years many Ras effectors were identified like Raf, RalGDS, PI(3)K, PKCf, 
MEKK, Byr2, AE-6, and Nore [41-54]. There is no sequence homology and no 
functional relationship among these effectors. There is no rule either for the 
location of the RED in the effector reaching from the N-terminus in AE-6 to the 
C-terminus in RalGDS. So far only for three effectors, namely Raf, PI(3)K, 
and RalGDS, could the activation of their biological function by Ras be 
demonstrated. Eigure 4 shows only five out of more than ten putative Ras effec- 
tors, also indicating the following pathways and the cellular response. 

Ras (Rap, R-Ras,TC21 ) 

I r 

MEKK (Byr2) Raf 

i i 

SEK/JNK Mek/Erk 

i i 

Transcription factors 



Cell growth and 
differentiation 

Fig. 4. Ras (in the GTP state) activates a variety of effectors each with different functions and 
cellular pathways. Only five out of more than ten putative effectors are shown. The signal cas- 
cades following effector triggering are schematically shown which lead to one of the possible 
cellular responses to activated Ras. Due to their high sequence homology. Rap, R-Ras, and 
TC21 may also interact with the Ras effectors. Scheme according to [55] 
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1. 4.3.1 
Raf 

Adenylate cyclase was identified as the primary Ras target in yeast {Saccha- 
romyces cerevisiae) [56] but it took a while before in 1993 several groups inde- 
pendently found Raf to be the effector of Ras in mammals [41-44]. Shortly 
afterwards it was realized that this is not the only target of Ras but up until now 
it appears to be the most prominent one. Raf is a Ser/Thr-specific protein kinase 
which phosphorylates and thereby activates Mek which in turn phosphorylates 
and activates Erk, leading to an amplification of the signal. Erk, also termed 
MAPK, has a plethora of phosphorylation targets, the most important of which 
are transcription factors such as Elk- 1, leading to activation of the transcription 
machinery in the nucleus. 

The catalytic domain of Raf is located in the C-terminal half whereas the N- 
terminal part is regarded as the regulatory domain [57]. Here two distinct 
domains adjacent to each other were identified, the Ras binding domain (RED) 
comprising 80 amino acids and the zinc ion binding region or cysteine rich 
region (CRR). Much interest has been directed towards the Ras/Raf interaction 
and the mechanism of Raf activation, since this protein couple represents a cru- 
cial hinge in signal transduction and thereby a potential target for anti-cancer 
drugs. Through binding to Ras • GTP, Raf is translocated to the plasma mem- 
brane and there is still debate going on about further activation events on Raf. 
By fusing two membrane translocation signals to Raf, namely the C- terminus of 
K-Ras including the poly-lysine stretch and the CAAX box, the biological func- 
tion of Raf could be fully activated, suggesting a mechanism of Raf activation 
simply by membrane recruitment [58, 59]. So far unknown factors also relating 
to the membrane were postulated to ensure sustained activation of Raf. Phos- 
phorylation of critical Tyr residues in the middle part of the Raf molecule are 
discussed as additional activation steps [60, 61], although the isoform B-Raf 
does not have these sites. It has been suggested that 14-3-3 proteins play a role in 
activation which may convey allosteric or stability effects onto Raf [62-64] or 
which lead to dimerization and subsequent transphosphorylation of Raf in this 
complex [65, 66]. The role of RBD is obvious, i.e., docking the Raf molecule to 
Ras and thereby to the membrane, whereas the function of CRR in Raf activa- 
tion is not so clear. Contradictory data are published about CRR involvement in 
Ras interaction [67-69], possibly binding to the farnesyl tail [70-72]. Eurther- 
more, CRR might play a role, due to its hydrophobic character, in membrane 
attachment, thereby reinforcing membrane docking [73], or it might have a 
contact with the kinase domain which adopts a different structure when it is 
attached to the membrane. An alanine scanning study identified Raf activating 
and inhibitory mutations and defined distinct epitopes, demonstrating the 
involvement of CRR in both Ras interaction and lipid cofactor binding [74]. 
Observations with the isoform B-Raf, which can be activated by farnesylated Ras 
• CTP in a membrane-free assay, again in concert with 14-3-3 protein [75], are in 
favor of an allosteric model of Raf activation. Nevertheless, impairment of 
Ras/Raf interaction leads to abrogation of growth signaling, suggesting the most 
important signal pathway from Ras effectively to be cut off. 
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1.4.3.2 

RaIGDS 

RalGDS is an exchange factor for a close relative of Ras, Ral, and it was identified 
as Ras effector, again independently by a number of research groups [45-47]. 
With Rif and Rgl, RalGDS-related proteins were found which are also GEFs 
for Ral and bind to Ras via their G-terminal RED [45, 76]. Elevated levels of 
Ras • GTP in the cell lead to increased loading of Ral with GTP [77]. Activated 
Ral in turn interacts with RalBPl which is a GAP specific for Gdc42 and Rac, 
thereby linking Ras signaling to the Rho family of small GTPases [78] (see 
below). It was suggested before that oncogenic Ras achieves full cell transfor- 
mation not by overactivation of Raf alone [79, 80]. Aberrant function in 
pathways regulating cell morphology is also required (see also Fig. 4). Mutants 
of Ras were found which block activation of one but not the other effector, the 
biological effects of which demonstrated the need for synergizing pathways 
[36]. A second function of RalGDS is transcriptional activation through c-fos 
[37, 77, 81] , the pathway of which is not quite understood yet. Rif, another Ral- 
GEF, is also involved in transcriptional activation [76, 82]. 

In certain cell types RalGDS acts opposingly to other Ras effectors. It was 
shown to suppress neurite outgrowth in PG12 cells which is induced by Ras- 
activated Raf and PI(3)K [83]. This and other examples [84] demonstrate that 
the cellular response to Ras and the timing of biological events depend on the 
magnitude and the kinetics of activation of different effectors and their down- 
stream targets which are also subject to the regulation by other signaling mole- 
cules. An example is given by the attenuated activation of Raf in contrast to 
RalGDS when these two effectors are phosphorylated by cAMP-dependent pro- 
tein kinase A [85]. 



1.4.33 

PI(3)K 

PI(3)K is a well known enzyme involved in the production of second messenger 
molecules, a-, /?-, and d-isoforms in class I of this enzyme are heterodimers con- 
sisting of a regulatory subunit, p85, which contains SH2 and SH3 domains, and 
a catalytic subunit, pi 10. pi 10-y does not bind to any p85 subunit and therefore 
cannot be activated by tyrosine kinase receptors but instead by a- and ^y-sub- 
units of heterotrimeric G-proteins [86]. Additionally or alternatively, all four 
pi 10 isoforms bind to Ras • GTP [48, 49, 87, 88] and more or less are thereby 
activated [87,89]. Amechanismfor this could again be simply recruitment to the 
membrane where the major substrates for PI(3)K are located, pi 10 has not only 
the ability to phosphorylate phosphatidyl inositols but also proteins. PI(3)K is 
therefore involved in diverse cellular processes and Ras in particular seems to 
link it to the regulation of cytoskeletal rearrangements and cell division, the 
molecular pathways of which are not precisely traced so far. Targets for the 
second messengers produced by PI(3)K are parts from different signaling 
pathways like the Ser/Thr kinase Akt/PKB, Rac, and p70-S6 [90-95]. The bio- 
logical consequences of PI(3)K activity appeared to be even more complex after 
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finding that PI(3)K can also act upstream of Ras [96]. Finally, it is worthwhile to 
point out that the Ras/PI(3)K signal branch via Akt and Bad is involved in the 
control of cell survival, yet another factor important in cancer [97] (see also 
Fig. 4). 



1.5 

Relatives of Ras 

Ras is the prototype of small GTPases but there are many other members in this 
protein superfamily specializing in distinct biological functions. The classifica- 
tion into subfamilies according to sequence homology coincides - not surpris- 
ingly - with functional relationships. While little is known about Rad proteins, 
the biological functions of members in the Rho, Rab, Arf, and Ran subfamilies 
shall be described briefly in the next sections but one. Yet before these cousins 
are reviewed, the brothers and sisters of Ras will be introduced in the following 
section. 



1.5.1 

Other Members of the Ras Subfamily 

According to their high homology in primary sequence, Rapl/2 and R-Ras/TC21 
are members of the Ras subfamily. In particular, they are characterized by their 
almost identical effector region (residues 32-40 [98]) where only Ral, also 
belonging to the Ras subfamily, shows significant deviations [11]. It is there- 
fore nof surprising that all these proteins except Ral can bind to the effectors 
described above, but the biological consequences of fhese interactions are very 
different. Concerning Ral, residues 36 and 37 in Ras-like proteins, respectively, 
were identified as tree-deferminant in respect to effector specificity [99]. 

Rap was discovered as a gene that was able to revert the phenotype of cells 
transformed by K-Ras and was therefore originally termed K-rev [100]. Based 
on this observation and further evidence it was assumed that Rap acts as anta- 
gonist to Ras by sequestering Ras effectors in unproductive complexes. This 
view has changed in recent years to attributing a more active role to Rap. In fact. 
Rap can activate B-Raf [101, 102] and Rif [103] demonstrating that Rap uses 
bona fide Ras effecfors but leading to different biological effects (for review see 
[104, 105]). This could be explained by fhe different cellular localization of Ras 
and Rap or the occurrence of Rap in special cell types. In case the hydrophobic 
tail is involved in effector activation, it is interesting to note that Rap is geranyl- 
geranylated. 

In contrast to the effectors which seem to be shared by the Ras subfamily 
members, specific GEFs and GAPs were identified for Rap. Whereas Ras is pre- 
dominantly activated by a cell membrane-translocated GEF, many GEFs known 
for Rap are directly activated by second messenger molecules like cAMP [106], 
diacylglycerol and calcium ions [105, 107- 109]. This may again have something 
to do with the localization of Rap at intracellular membranous compartments. 
Gertainly the GAPs [110] of Rap play an important role in regulation as 
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well since the intrinsic GTPase activity of Rap is ten times lower compared 
to Ras. 

Ral has attracted much interest in recent years, not least because it was 
demonstrated to mediate part of Ras function as described above. In contrast to 
Rap, which rather inhibits Ras signaling, Ral is part of one of the essential Ras- 
activated pathways. Moreover, it has proved to be acting in parallel with the Raf 
pathway in cell transformation induced by oncogenic Ras [37, 77]. The case 
of Ral demonstrates the complexity - and the incomplete knowledge and 
understanding - of signal transduction. Ral can also be activated by Rap 
mediated by Rif [103] and, alternatively, by binding of a calcium/calmodulin 
complex to the Ral C-terminus which obviously does not affect the nucleotide 
state of Ral [111]. 

Generally, the presence of several GEFs and GAPs for each GTPase introduces 
a lot of variability and complexity and it is not easy to recognize physiological- 
ly significant partnerships. R-Rasl and 2, the latter also termed TG21, seem to 
share Ras effectors and to activate the corresponding pathways, not all of them 
and not to the same extent though. Both proteins were found to lead to cell trans- 
formation when their GTPase activity is deregulated, suggesting oncogenic 
potential at least in certain cell types [112-115]. The product of the proto- 
oncogene bcl-2, which is involved in control of apoptosis, binds to the C- 
terminus of R-Ras [116] but the mechanism of regulation is not known. For sake 
of completeness the other members of the Ras subfamily, Rheb, M-Ras, and 
Rit/Rin, should be mentioned here but little is known about their function 
[117-119]. 

1.5.2 

Rho Subfamily 

Of the eight subfamilies of small GTPases, the Rho family is closest related to 
Ras from the functional point of view and shall be described here briefly. For all 
three members in this subfamily, Rho, Rac, and Cdc42, particular functions in 
cytoskeletal organization are established [120]. Rho activation leads to 
adhesion and the formation of stress fibers, Rac is responsible for membrane 
ruffling, and Gdc42 is involved in filopodia formation. As described above for 
Ras, each Rho protein is activated and deactivated by specific GEFs and GAPs, 
respectively [121]. With the GDIs, a third class of regulators is found for Rho 
proteins. Recently, the structural basis was elucidated for their function in 
inhibition of both guanine nucleotide dissociation and GTPase activity [122]. 
Specific sets of effector proteins are known for all three family members. 
Through these and in part unknown proteins, the signals are mediated to 
F-actin or actin-binding proteins, which results in distinct processes of cyto- 
skeletal rearrangement. In addition, and in analogy to Ras, MAPK modules 
are turned on by activated Rho proteins [123- 125]. These kinase cascades lead 
to transcriptional activation which results here in morphological alterations 
of the cell. 

A functional connection between the Rho proteins has been observed. Gdc42 
may lead to the activation of Rac which for its part can activate Rho [ 126]. The 
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communication between different pathways, often referred to as cross talk, is 
characteristic for signal transduction but for many pathways only vaguely 
described or postulated. Nevertheless, there are also links between Ras and 
Rho/Rac pathways and their synergism is essential for complete cell transfor- 
mation [79,80, 127]. 



1.5.3 

Rab and Arf Subfamilies 

There are more than 40 different Rab proteins known so far and the size of this 
subfamily is still growing [128]. They are involved in direction of vesicular 
transport between subcellular compartments like endosomes, endoplasmic reti- 
culum, Golgi apparatus, etc. [129, 130]. In their GDP-bound form the prenylated 
Rab proteins stick to the donor membrane where GEF mediated exchange for 
GTP occurs. Bound to the detached vesicle Rab • GTP travels to the acceptor 
membrane and after fusion of the vesicle with this membrane GTP hydrolysis is 
enhanced by acceptor membrane-localized GAP. The way back to the donor 
membrane through the plasma is facilitated by company of a GDI protein which 
increases the solubility of Rab • GDP by masking the hydrophobic prenyl group. 
For a more detailed description of the role of Rab in target membrane recogni- 
tion and the involvement of SNARE proteins in membrane fusion, one is refer- 
red to reviews in this field [130 -132]. Members of the Arf subfamily play a role 
in vesicle transport as well [ 133]. A major difference in their function is that they 
recruit coating proteins onto the trafficking vesicle which dissociate after GTP 
hydrolysis. 



1.5.4 

Ran 

Ran (the Ras-related nuclear protein) is the major regulator of nucleo-cytoplas- 
mic transport [134] across the nuclear pore complex (NPC). Like other small 
Ras-like GTP-binding proteins, it switches between a GTP- and a GDP-bound 
form by GTP-hydrolysis and nucleotide exchange [135]. In contrast to its rela- 
tives, Ran does not undergo posttranslational modification. 

The exclusive nuclear localization of the Ran exchange factor RCGl [136] 
ensures that the generation of Ran • GTP is confined to the nucleus. The conver- 
sion of Ran • GTP into Ran • GDP is catalyzed by the GTPase activating protein 
RanGAPl [ 137], which is present only in the cytoplasm or at the cytoplasmic site 
of the nuclear pore, efficiently depleting Ran • GTP from the cytoplasm (Fig. 5). 
This differential localization of the regulators of Ran’s nucleotide-bound state 
should thus result in a Ran • GTP-gradient across the NPC which is believed to 
drive the import-export cycle [138]. 

Ran interacts with transport receptors which load and unload their cargo in 
the respective compartment depending on the nucleotide state of Ran [138]. An 
importin binds its cargo initially in the cytoplasm, gets translocated through the 
NPC, releases the cargo upon binding Ran • GTP in the nucleus [ 139 - 141 ], and 
returns to the cytoplasm as a Ran • GTP complex. The removal of Ran • GTP 
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Fig. 5. The cellular compartment defines the nucleotide state of Ran. In the cytosol a Ran- 
specific GTPase activating protein (RanGAP) ensures that Ran exists almost exclusively in 
the GDP-complexed state. In the nucleus a Ran nucleotide exchange factor (RCCl) catalyzes 
the exchange reaction from Ran • GDP towards Ran • GTP. Import and export complexes re- 
cognize their actual cellular environment by interaction with the Ran protein 



from the importin involves the hydrolysis of the Ran-bound GTP and allows the 
importin to bind and import the next cargo molecule. 

Binding of substrates to exportins is regulated in a converse manner 
to importins. Exportins bind their cargoes preferentially in the nucleus, form- 
ing a trimeric cargo • exportin • Ran • GTP complex [142]. This trimeric com- 
plex is then transferred to the cytoplasm where Ran • GTP is converted to 
Ran • GDP. This results in Ran’s dissociation from the complex, allowing the 
exportin to release its substrate, re-enter the nucleus, and to start the next export 
cycle. 

The recycling process of Ran between its GDP and GTP complexed state is 
supported by another class of Ran-binding proteins, which support GTP-hydro- 
lysis by RanGAP and are either soluble proteins [143] or part of the nuclear pore 
complex [144]. 

Sequences of Ran and Ran-interacting proteins are highly conserved between 
different species and allow interspecies exchange of GEFs and GAPs from, e.g., 
yeast to mammalian. Disturbing the Ran-system causes analogous effects on the 
organization of the nuclear skeleton, RNA-processing and transport, protein 
import, and cell-cycle control [145]. 
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2 

Biophysical Methods 

Biophysical analysis of biomolecules like proteins, nucleic acids, or lipids utilizes 
intrinsic physical properties of the observed molecule itself or of an associated 
reporter molecule, which reflect information about structural characteristics, 
interactions, or reactions of the subject observed. In most cases the analysis 
(and the labels introduced) only interferes slightly with the interaction of 
interest and does not induce significant changes in the properties of the reac- 
tants. 



2.1 

Readout for Biological Macromolecules 

Information about the state of an interaction can be achieved by a multitude of 
different readouts. Most of them are based on spectroscopical properties of the 
biomolecules or the labels introduced and allow real time analysis of the reac- 
tions. In some cases samples have to be retained and the analysis has to be per- 
formed subsequently. 

2 . 1.1 

Microscopic Properties and Macroscopic Signais 

The main interest of biophysical analysis is to get information about function 
and mechanism of interactions between biomolecules. This is reflected by the 
typical way of presenting models for biological systems. Individual molecules 
bind to each other, catalyze an enzymatic reaction, or exhibit special structural 
features. The idea about the system is presented on a microscopic level. 

Nevertheless most of the biophysical methods do not observe single mole- 
cules but a huge amount of players in a concentration range between 10''* and 
molecules/1 (corresponding to 100 pmol/1 to 1 mmol/1, which are reasonable 
concentrations in cellular systems). While the dynamics of the single reactants 
is stochastic the macroscopic readout of the measuring system normally can be 
described with ordinary differential equations. 



2.1.2 

Radioisotopes 

Even nowadays the application of radioactive isotopes is the most sensitive 
method for the analysis of biomolecules or their reaction products. Besides 
the low detection limits, the replacement of a naturally overbalancing stable 
isotope by its radioactive analogue does not interfere with the physical or 
chemical properties of the enzyme (with some exceptions for hydrogens). 
Figure 6 lists some frequently used radioactive isotopes and their half-life 
periods. 



78 



J. Kuhlmann • C. Herrmann 



\H \He + p + hv, tj /2 = 12.43 a 
15 ^ ^ + hn = 14.3 d 

jgS + p + hv, tii 2 — 87.1 d 

\^C + p- + hv, tu 2 = 5756 a 

Fig. 6. Some frequently used Radioactive Nucleotides in Biochemistry. All nucleotides present- 
ed here are /1-emitters with halflife periods from days (d) to thousands of years (a) 



Concentrations of radiolabeled proteins, substrates, or products can be quan- 
tified by scintillation counters, which detect both emitters of weak (e. g., ^H) and 
high energy (e.g., by excitation of an organic solvent (e.g., toluene) which 
then emits fluorescence light. In commercial systems the primary fluorescence 
is transformed via one or two additional fluorescent dyes in the solution into a 
visible emission signal which can easily be detected by conventional photomul- 
tipliers. 

A locally resolved detection of radioactive samples after chromatographic 
separation can be performed by imaging techniques which work either indirec- 
tly with Eu^+- or P-doped sensor plates and laser activated emission or direct- 
ly by a microchannel array detector which works like an open Geiger-Miiller 
counter. 



2.1.3 

Staining Techniques 

Direct staining of proteins (e.g., after electrophoretic separation in polyacryla- 
mide gels) can be achieved by treatment with dyes like Coomassie Brilliant Blue 
R-250 [ 146] (Fig. 7), which binds positively charged proteins in an acidic fixation 
buffer, allowing detection down to 0.1 pg of protein. 

An increase in sensitivity is realized by silver-staining, where residues con- 
taining sulfur (cysteine, methionine) or basic side chains (arginine, lysine, 
histidine) reduce Ag"^, leading to brown or black colored bands. Here, down to 
0.1 ng of protein can be detected. 

For quantitative analysis of protein concentration the colorimetric Bradford- 
assay [ 147] is most commonly used. Here another Coomassie dye. Brilliant Blue 
G-250, binds in acidic solutions to basic and aromatic side chains of proteins. 
Binding is detected via a shift in the absorption maximum of the dye from 
465 nm to 595 nm. Mostly calibration is performed with standard proteins like 
bovine serum albumin (BSA). Due to the varying contents of basic and aroma- 
tic side chains in proteins, systematic errors in the quantification of proteins 
may occur. 

2.1.4 

Enzymatic Detection 

Enzymatic reactions for the direct quantification of low molecular weight sub- 
strates and products are well established in clinical chemistry or nutrient analy- 
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CH3CH2O 





Fig. 7. Coomassie Brilliant Blue R-250 (I) is used to stain proteins, e.g., after gel-electrophore- 
tic separation, its derivative G-250 (II) is applied in the Bradford assay for protein quantifica- 
tion 



tics. For proteins enzymatic methods enhance sensitivity of detection and 
decrease the limits for quantification by orders of magnitude. 

A classical approach is the enzyme linked immunosorbent assay (ELISA), 
where the antigen (e.g., the protein to be quantified) is immobilized on the sur- 
face of a well. A first antigen-specific antibody is applied to occupy all antigens, 
before a second antibody binds all primary antibodies on the well. The second 
antibody carries an enzyme, which now catalyzes a color reaction. If the sub- 
strate of the enzyme is given in high excess, the enzyme is saturated and the pro- 
duction of product is linear with time and concentration of second antibody and 
antigen (Fig. 8). 

There exists a wide variety in the setup of ELISA assays (direct binding or 
competition setups) and the enzymatic reaction utilized [148]. A similar prin- 
ciple to enhance sensitivity by enzymatic coupling is realized after gel electro- 
phoretic separation of proteins. Here proteins are transferred to nitrocellulose 
(“western blot”) and detected by antibody-coupled enzymes. 
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I II III 

Fig. 8. Schematic presentation of a enzyme linked immunosorbent assay (ELISA). An antigen 
(•) is immobilized on the surface of a microtiter plate and incubated with its antibody (abl). 
A second antibody (ab2) with a covalently linked enzyme (★, e.g., horseradish peroxidase) 
binds to the primary one and catalyzes a color reaction with its enzyme. All incubations are 
separated by washing steps 



2.1.5 

UV/Vis Spectroscopy 

There are two principle ways for optical detection of protein concentrations: 
either the macromolecule or its label emits energy (after excitation by light) - 
then a fluorescence signal can be measured; or it absorbs energy from electro- 
magnetic waves passing the sample - then the optical absorption of the sample 
can be measured by UV/Vis spectroscopy and concentrations can be calculated 
according to Lambert-Beers Law. 

There is a strong limitation in the concentration range due to the logarithmic 
relationship between transmission and concentration (optical densities rea- 
sonably to measure range from 0.1 to 1.5). Nevertheless, protein quantification 
by direct UV-measurement or after staining with dyes in the visible range is a 
very robust method and can be found, e.g., as a common detection mode in 
HPLC or other chromatographic techniques. 



2.1.6 

Fluorescence-Based Assays 

The most populated energy state of chemical species at room temperature is the 
ground state. Once a molecule has absorbed energy in the form of electro- 
magnetic radiation, it returns to the ground state, which can occur via several 
routes, some of which are shown in the Jablonski diagram (Fig. 9). 

Fluorescence is possible if the absorption of light (A) leads to a higher vibra- 
tional state for the excited molecule than in its ground state. Internal conversion 
(IC) by vibrational relaxation leads to a lower energy (or higher wavelength) of 
the emitted light (F) when the molecule returns to the electronic ground state. 
Lifetimes of fluorescent states are very short (10^^ s to 10 ® s). This process is in 
competition with vibrational relaxation, which usually occurs much faster and 
is enhanced by physical contact of an excited molecule with other particles. 
Therefore large portions of energy can be transferred through collisions. 
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A ■ photon absorption 
r s Fluorescence 
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Fig. 9. Jablonsky Diagram for energy conversion pathways of an excited molecule. While flu- 
orescence occurs between states of the same spin, an ISC (inter system crossing) leads to spin 
inversion and a delay in emission (phosphorescence halftimes from 10“^ s to minutes or even 
hours) 



Due to the separation between excitation and emission in a fluorescence 
spectrometer, concentrations can be detected down to picomolar, with a wide 
linear range over up to five orders of magnitude. As a consequence of, e. g., vibra- 
tional relaxation, the amount of energy which is released as fluorescence (quan- 
tum yield) is strongly dependent on solvent and temperature. 

A special option for measurements is realized in fluorescence polarization. 
Because a chromophore can absorb light energy only with high probability if 
the dipole moments of the transition state are parallel with the incident 
beam, only those molecules are excited by polarized light which have the cor- 
responding orientation. During the lifetime of the fluorescence the mole- 
cule may change its orientation by rotation. The fluorescence light is again 
polarized with an axis parallel to the transition dipole moment of the chromo- 
phore. 

If a sample is excited with polarized light and emission is measured through 
a second set of polarizers parallel and perpendicular to the first polarizer, the 
ratio of the two emission signals reflects the rotatory freedom of the chromo- 
phore. In practice, binding of a second molecule to the labeled one can be de- 
tected if the size of the chromophore complex increases considerably. The 
advantage of this method is that no changes in quantum yield are necessary for 
the observation of the binding reaction. 
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2.1.7 

Surface Plasmon Resonance 

The introduction of optical biosensors has made it possible to obtain data for a 
large number of macromolecular interactions without the necessity of addi- 
tional labeling. Here several commercial instruments utilize the effect of Surface 
Plasmon Resonance (SPR) to detect accumulation of ligands in the sensor 
matrix. 

In an SPR system a beam of polarized light is sent through a medium of 
higher refractive index towards a second medium with lower refractive index. If 
the incident light enters the interphase between the two phases beyond the 
critical angle, the beam is reflected totally, while an evanescent wave propagates 
from the interface into the lower refractive index medium, decaying within a 
distance of approximately one wavelength below the interphase. 

The phenomenon of surface plasmon resonance is observed at thin metal 
films inserted between two phases of different refractive index. Now the free 
electron clouds (plasmon) of the metal are able to interact with the evanescent 
wave for specific angles of the incident beam. These angles are dependent on the 
ratio of the refractive indices of the two media and a reduced intensity of the 
reflected beam can be monitored. In the BIAcore system the high refractive 
index media is built by a glass prism coupled mechanically to the glass surface 
of the sensor chip (Fig. 10). The chip itself is covered with a 50-nm gold layer, 
which can be modified with dextran hydrogels or long-chain alkanethiol mole- 
cules in order to generate artificial surfaces [149]. 

In general the first reactant is immobilized on the surface, while the second 
one is applied in buffer solution. Enrichment of macromolecules (e.g., proteins) 
at the aqueous site of the sensor leads to an increase in refractive index of the 
corresponding phase and thereby to a change in resonance angle, which can be 
monitored directly in a time-resolved manner. 



2.1.8 

Circular Dichroism 

Optically active chromophores show different absorption for left and right 
circular polarized light (where the orientation of the polarized light changes 
periodically). These substances modify a circular polarized beam in such a way 
that the light is elliptically polarized after leaving the sample, an effect called 
circular dichroism. 

In proteins in particular the peptide bonds contribute to the CD-spectra of 
the macromolecule. Here, CD-spectra reflect the secondary structure of pro- 
teins, which are derived from CD-spectra of model macromolecules with only 
one defined secondary structure (like poly-L-lysine at given pH values) or based 
on spectra of proteins with known structures (e. g., from X-ray crystallography). 
The amount of a-helices or /I-sheets in the unknown structure is calculated by 
linear combination of the reference spectra [150, 151]. 
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Fig. 10. A SPR Detection realized in a BIAcore system. A fan of polarized light passes a prism 
and is focused at the interface to an aqueous phase under conditions of total reflection. An 
evanescent wave enters the solvent phase. If the prism is coated with a thin gold layer at the 
interface the free electrons in the metal absorb energy from the evanescent wave for a distinct 
angle, depending on the refractive index of the solvent near the interface. B The gold layer can 
be modified with, e.g., a carboxydextrane matrix, where catcher molecules can be immobi- 
lized by standard chemistry. If a ligand is applied with the aqueous phase it may interact with 
the catcher and accumulate in the matrix, causing a shift in the resonance angle. If no specific 
binding occurs the refractive index in proximity of the sensor is less affected 



2.2 

Equilibria 



2.2.1 

Pull Down and Filter Binding Assays 

A simple method to analyze binding equilibria with slow association and disso- 
ciation kinetics is the precipitation of one binding partner with an antibody or 
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a specific affinity tag. The amount of free and complexed macromolecules can 
then be analyzed, e.g., after SDS-page separation with radioactive labels or 
staining techniques. 

A related approach is realized in filter binding assays. Here the reaction solu- 
tion is filtered, e.g., through nitrocellulose where proteins are absorbed, while 
small molecules can pass. One example of this technique is the quantification of 
protein bound and free nucleotides (with radioactive labeled ligands). 

Because the precipitation (or affinity binding step) perturbs the equilibrium 
of interest it has to be ensured that removal of one reactant is much faster than 
the dissociation or association of the binding partners. 



2.2.2 

Fluorescence Titration 



Measurements of binding curves without influencing the equilibria can be per- 
formed if the readout for complex formation is correlated with a change in a 
macroscopic signal. This can be either a change in fluorescence intensity, fluor- 
escence polarization, optical absorption, or heat of association (see next chapter). 

Assume an equilibirium 

A -t B AB 



If concentration of B is kept constant, then [B]n,ax represents the maximum num- 
ber of binding sites available. During the experiment increasing amounts of A 
are added, occupying more and more of B and increasing the concentration of 
AB. 

In equilibrium there exist free A (concentration [A]), free B ([B]) and com- 
plex AB ([AB]), where the dissociation constant describes the ratio: 

[A] X [B] 

JVr) — 

[AB] 



After replacement of [B] by [B]„,;jx-[AB] this equation can be rearranged to 



[AB] 

[A] 



Kd 



— X [AB] 
Kd 



A plot of [AB]/[A] vs [AB] (Scatchardplot,Fig. IIB) should therefore result in a 
straight line with the slope l/K^ and the intercept [B,^,jx]/Kd [152]. 

Figure llA shows a theoretical example of a titration curve A -I- B = AB, 
where the signal is proportional to the amount of complex. The solid lines repre- 
sent conditions where B„,ax is equal to Kq. Here for both presentations of signal 
vs either [A,o,ai] (total concentration of A added to the preparation) or [A^ee] 
(concentration of non-complexed A in the solution, calculated as [ A,o,j,i] - ( [ AB] ) 
the plot is curved and allows discrimination between free and complexed bind- 
ing partners. If [B^^^x] is substantially higher than the issue of active site 
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Fig. 1 1 . A Theoretical binding curve for an A + B = AB interaction. Black lines indicate a plot 
of signal vs [A,„,^], Grey lines vs [Af^j^]. The dashed line shows an example of an active site 
titration. B Scatchard plot [154] 




titration becomes relevant. In Fig. 11 A the dashed line presents a plot of signal 
vs [A,otai] for [B^ax] = 100 X Kn. Under these conditions ligand A is complexed 
almost quantitatively by B, resulting in a straight line until almost all B is satu- 
rated by A. Although this titration has to follow the equations above, small errors 
in the determination of [A], [B], or [AB] make it impossible to quantify Kp re- 
liably. On the other hand this kind of binding experiment is often used to deter- 
mine the number of binding sites or, alternatively, the fraction of active protein. 
Alternative treatment of titration data is described elsewhere, e.g., [153]. 
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For reasonable conditions [B] should be below the expected value, and 
[A] should span a range from 0.1-10 times Kd. Analysis of real binding curves 
has to consider additional sources of error such as equilibrium not being 
achieved, errors in the determination of the concentrations, or non-specific 
binding. 

Complex binding reactions (more than one binding site for A on B, coopera- 
tive binding, etc.) can be described, e.g.,by the Hill plot [154]. 



2.2.3 

Isothermal Titration Calorimetry 

Many of the methods described above may suffer from the modifications of the 
proteins which are necessary to obtain a detectable readout for the interaction 
under investigation. As a matter of fact, fluorescent labels are attached in close 
vicinity to the site of interaction, as otherwise binding to a partner is not likely 
to be detectable. It is conceivable that such labels invoke altered binding 
properties. Similar skepticism and arguments hold for immobilizing a protein. 
Non-invasive labeling is possible with radioactive isotopes but these methods 
bear other disadvantages. 

Calorimetry takes advantage of an effect inherent to all interactions. Heat 
released or being consumed upon association is used to detect the interactions 
of proteins with their ligands or other proteins. This allows direct experimental 
access to the enthalpy of binding. A disadvantage is the relative high amount of 
material needed. Briefly, in this method one of the two reactants is added step by 
step from a syringe to a thermally insulated cell containing the other reactant, 
and the evolving heat is measured. A fit of a binding isotherm to the titration 
data yields the affinity constant and the stoichiometry of the complex formed, 
i.e.,the number of binding sites [155]. A typical experiment is shown in Fig. 12. 
From these results the entropy of binding can be calculated. In addition, mea- 
suring the enthalpy in dependence of the temperature yields the heat capacity of 
binding. Altogether, isothermal titration calorimetry facilitates the investigation 
of biomolecule interactions in solution - without labeling or immobilization - 
leading to a detailed thermodynamic characterization. 



2.3 

Kinetic Analysis 



2.3.1 

Separation Techniques 

As described for equilibria before, separation techniques like filter binding 
assays or pull down methods can be applied to separate educts and products of 
slow reactions too. Here modern HPLC systems with autosampler and time pro- 
grammed injection offer a convenient approach for the analysis of reactions in 
the time scale of minutes and hours. 
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Fig. 12. Ras in complex with GppNHp at 600 pmol/1 is injected into a solution of 45 pmol/1 
RalGDS-RBD. In the upper panel the (peakwise) change of the heating power is recorded 
which is necessary to keep the cell at constant temperature after each injection. The integrated 
peaks of the upper panel are plotted vs the molar ratio of Ras • GppNHp/RalGDS in the lower 
panel. The fitted curve yields the data in the box, where N indicates the stoichiometry, K the 
affinity constant and H the enthalpy of binding 



2.3.2 

Stopped Flow 

If the reaction of interest is too fast to separate reaction products with classical 
techniques, or even to follow the change in signal in a standard cuvette by a 
UV/Vis or fluorescence spectrometer, stopped flow systems are a suitable alter- 
native. Here, the reaction partners are filled in two syringes, which can be pro- 
pelled by a pneumatic ram. The two reactants are then mixed in a small cham- 
ber, pass through a cuvette and end up in a stop syringe. As long as the piston of 
this syringe does not reach the trigger switch where it stops, there is a constant 
flow and the mixture in the cuvette will be exchanged continuously. After the 
stop the reaction proceeds in the cuvette and can be observed by spectroscopi- 
cal (or other) readouts. Figure 13 shows the setup of a stopped flow system. Com- 
mercial instruments have dead- times of 1 - 2 ms and allow observations of reac- 
tions with half-times down to the ms range. 
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Fig. 13 . Scheme of a stopped flow instrument. A pneumatic or electric driven ram pushes the 
contents of two driving syringes into a mixing chamber (M). The solution then passes a cuvette 
{cuv) and fills a stop syringe. When the piston of the stop syringe reaches the trigger switch 
the flow stops. Light of a selected wavelength passes through the cuvette. Reactions can be 
observed by changes in absorption (T = detector of transmitted light) or fluorescence signal 
(detector F) 



To circumvent complex mathematical evaluations, stopped flow experiments 
are measured preferentially under pseudo first-order conditions. The rate of 
complex formation between protein A and protein or ligand B is described by 



d[AB] 



= [A] X [B] -k„ffX [AB] 



dt 
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[A] may be replaced by [Aq] - [AB], and if [Bq] » [ Aq] the change in [B] may be 
neglected, i.e., it may be approximated as constant corresponding to pseudo 
first-order kinetics (where [Aq] and [Bq] are the concentrations of A and B at 
t = 0). Integration leads then to 

[A]t = [A]o X exp(-t X (k„„[B]o -I- k^ff)) 

Thus, the time traces can be fitted by an exponential equation and the linear 
concentration dependence of the observed rate constant yields the slope kg^ and 
the ordinate k^ff. 

One advantage of (pseudo) first-order kinetics is the independence of the 
kinetic parameters in relation to the active concentration of ligand A. The only 
concentration which has to be determined precisely is that of the ligand in 
excess (B). 



2.3.3 

Surface Plasmon Resonance 

As described for stopped flow experiments above, all commercially available 
SPR systems work under (pseudo) first-order conditions as well. This is realized 
either by a large excess of free ligand (in the large volume of the cuvette) com- 
pared with a nanoliter volume of the sensor layer [156] or by continuous re- 
placement of free ligand in a flow injection system (e.g., BIAcore [157]). 

The signal is measured in terms of resonance units (RU) and the mathemati- 
cal treatment is as described above. An increase in RU signal is proportional to 
the concentration of complex AB. The maximum of the specific change in signal 
then corresponds to a complete saturation of the immobilized ligand. 

Rinsing the sensor surface with buffer results in an irreversible dissociation, 
because all molecules which dissociate from immobilized protein are removed 
from the system by the buffer stream, allowing one to determine the rate con- 
stant of dissociation separately. 

Again, care has to be taken for the non-ideal (or real) behavior of the measu- 
rement system. Applications are limited by non-specific absorption of mole- 
cules on the surface, mass transfer effects (under conditions of laminar flow a 
1-5-pm layer between sensor surface and volume flow is not whirled and has 
to be passed by passive diffusion) or limited access for the immobilized mole- 
cules [158-160]. 



2.4 

Structure 

Determination of three-dimensional protein structures has become an impor- 
tant tool in protein research, indicated by the exponentially growing number of 
protein data bank entries during the last decade. They have contributed con- 
siderably to our understanding of function and mechanism of biomolecules. 
Structures of proteins alone or in complex with their substrates or binding part- 
ners like cofactors, DNA, or other proteins can visualize interactions at the atom- 
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ic level. On the one hand mechanisms elucidated by biochemical methods may 
find further support or corrections. On the other hand the three-dimensional 
picture often proved to be the starting point for a detailed analysis of protein 
function and mechanism, particularly by mutating amino acid residues and 
studying the effect at key positions of the protein identified in its structure. 
Pharmaceutical companies began to direct their interest towards the structures 
of potential drug targets and the term rational structure-based drug design was 
able to supersede drug screening - at least for a short while. There are two major 
techniques for structure determination - X-ray analysis and NMR spectroscopy, 
both of which have their advantages and disadvantages, with the potential to 
complement each other. 

2.4.1 

X-Ray Analysis 

For structure determination by X-ray analysis, single crystals of the protein are 
required. Depending on the X-ray source used, which can be a conventional X-ray 
rotating anode or a highly focussing synchrotron beam line, the minimum size of 
crystals may range from 0.3 mm down to 0.05 mm in diameter. The crystalliza- 
tion of the proteins is the key step towards their three-dimensional structure. 
Appropriate conditions are screened for by systematically changing the condi- 
tions in the solution like variation of temperature, pH, various salts and their con- 
centrations, concentration of the protein, and, particularly, the type and concen- 
tration of precipitants (e.g., ammonium sulfate or diverse polyethylene glycols). 

Once a suitable crystal is obtained and the X-ray diffraction data are collected, 
the calculation of the electron density map from the data has to overcome a hurd- 
le inherent to X-ray analysis. The X-rays scattered by the electrons in the protein 
crystal are defined by their amplitudes and phases, but only the amplitude can be 
calculated from the intensity of the diffraction spot. Different methods have been 
developed in order to obtain the phase information. Two approaches, commonly 
applied in protein crystallography, should be mentioned here. In case the struc- 
ture of a homologous protein or of a major component in a protein complex is 
already known, the phases can be obtained by molecular replacement. The other 
possibility requires further experimentation, since crystals and diffraction data 
of heavy atom derivatives of the native crystals are also needed. Heavy atoms may 
be introduced by covalent attachment to cystein residues of the protein prior to 
crystallization, by soaking of heavy metal salts into the crystal, or by incorpora- 
tion of heavy atoms in amino acids (e.g., Se-methionine) prior to bacterial syn- 
thesis of the recombinant protein. Determination of the phases corresponding to 
the strongly scattering heavy atoms allows successive determination of all phases. 
This method is called isomorphous replacement. 

The result is the electron density map of the protein crystal. The final task for 
the crystallographer is to build the appropriate protein model, i. e., putting amino 
acid for amino acid into the electron density. Routinely the theoretical amplitudes 
and phases are calculated from the model and compared to the experimental data 
in order to check the correctness of model building. The positions of the protein 
backbone and the amino acid side chains are well defined by X-ray structures at a 
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resolution around 2 A. Many examples of protein structures are known reaching 
down to a resolution of 1 A, which is solely controlled by the quality of the crystal. 

2.4.2 

NMR Spectroscopy 

Whereas structure determination by X-ray analysis, basically, is not restricted by 
the size of the molecule, there is an upper limit in NMR spectroscopy. The main 
reason for this is not the increasing complexity of the spectra with the size of 
the proteins but the increasing line widths of the resonance peaks. Due to a 
decreased tumbling rate with growing size of the molecule in solution, the 
spin/ spin relaxation times become shorter, the inverse values of which define the 
width of the peaks. Depending on the strength of the magnetic field in the spec- 
trometer, the biomolecule may be as large as 30-40 kDa. However, the newly 
introduced TROSY-spectroscopy which leads to strongly reduced line widths 
gives the promise that in future NMR structure determination will be possible 
even with much larger proteins [ 161, 162]. 

Two major advances have made NMR spectroscopy available for protein 
structure determination: the introduction of pulsed excitation combined with 
Fourier analysis and the still on-going development of multi-dimensional NMR 
techniques. These methods, also known as correlation spectroscopy, require 
labeling of the protein with isotopes like '^N and the nuclei of which have a 
spin quantum number of 1/2. This is achieved by feeding the bacteria exclusive- 
ly with '^NH4C1 and *^C-glucose and a cocktail of essential salts. The assignment 
of all peaks to the corresponding amino acids represents the bottleneck in pro- 
tein structure determination by NMR spectroscopy. Basically, the information 
about the three-dimensional structure of the protein is extracted from NOESY 
spectra. The nuclear Overhauser effect underlying this two-dimensional techni- 
que generates cross-peaks in these spectra which originate from two spins each 
which are coupled through the space and not through covalent bonds. This 
allows for identification of neighbors other than those dictated by the sequence. 
Also the distances can be calculated and, based on this information (as a rule of 
thumb > 10 NOE signals per residue), the three-dimensional structure can be 
constructed. Typically, short- and medium-range NOEs are used to build /?- 
sheets and a-helices, which are also identified by their specific chemical shifts, 
whereas long-range NOEs are useful for arrangement of the tertiary structure. 

The main advantage of NMR spectroscopy is its use with proteins in solution. 
In consequence, rather than obtaining a single three-dimensional structure of 
the protein, the final result for an NMR structure is a set of more or less overly- 
ing structures which fulfill the criteria and constraints given particularly by the 
NOEs. Typically, flexibly oriented protein loops appear as largely diverging 
structures in this part of the protein. Likewise, two distinct local conformations 
of the protein are represented by two differentiated populations of NMR struc- 
tures. Conformational dynamics are observable on different time scales. The 
rates of equilibration of two (or more) substructures can be calculated from 
analysis of the line shape of the resonances and from spin relaxation times Tj 
and T 2 , respectively. 
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3 

Analyzing Function and Properties of Ras 



3.1 

Properties of Ras 

3.1.1 

Nucleotide Binding 

Ras proteins are the prototype for small GTP-binding proteins with the feature 
of high selectivity and affinity for guanosine di- and triphosphate. The first 
experiments to obtain information about the kinetics of nucleotide dissociation 
from Ras were performed with ^H-labeled GDP or ^^P-labeled GTP in filter bin- 
ding studies [163]. Here, Ras protein was loaded with radioactive nucleotides 
and incubated with an excess of non-labeled GDP for a given time. Separation of 
protein-bound and free nucleotide was achieved by filtration through nitrocel- 
lulose filters and resulted in a dissociation rate constant k^iss of 0.0079/min for 
GDP and 0.023/min for GTP with wild type protein at 37 °G. The dissociation 
reaction for GTP was superimposed by the intrinsic GTP-hydrolysis. Therefore 
kdiss(GTP) for the oncogenic RasG12 V protein (0.0047/min) may reflect the dis- 
sociation reaction more realistically. 

Access to kinetic data for the association was made possible by the introduc- 
tion of fluorescence labeled guanosine-nucleotides. Particularly helpful was the 
attachment of the methylanthraniloyl residue (mant) at the ribose, leading to an 
equilibrium mixture of the mant group at the 2'- and 3'-positions [164]. Non- 
hydrolyzable analogues of GTP commonly used are GppNHp or GTP-yS (see 
Fig. 14) [164], 

The quantum yield of the mant fluorophore in mant-GDP or mant-GTP 
increases approximately by 100% when the molecule changes from the aqueous 
environment into the nucleotide binding pocket of the Ras protein. Therefore 
the kinetics of complex formation between nucleotide free Ras and the mant 
analogues of GDP or GTP could be detected easily in a stopped flow system by 
an increase in fluorescence signal. 

The association rate constant for mant-GDP with a value of 1.1 x 10®/(M x s) 
at 25 °G together with the dissociation rate constant result in an affinity in the 
pmol/1 range. In addition, the pseudo-first order kinetic experiments revealed a 
hyperbolic rather than a linear concentration dependence, indicating a two-step 
binding mechanism. Presumably, the initial formation of a weak complex is fol- 
lowed by a conformational change of the protein (induced fit) leading to tight 
binding, which is rate-limiting at high concentrations. All constants indicated in 
the following scheme could be obtained by stopped flow experiments where 
KD-Ki-kVk2; 

Ras -t- mantGDP Ras • mantGDP y ' Ras*-mantGDP 

k-2 
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Fig. 14A-C. Guanosine-nucleotides for biophysical characterizations of Ras proteins: A gua- 
nosinediphosphate with a methylanthraniloylester (in grey) at the 3'-position of the ribose; 
B guanosine-5'-[/5,y-imido] -triphosphate (GppNHp); C guanosine-5'-0-(3-thiotriphosphate) 
(GTP-yS) 



3.1.2 

Intrinsic GTP Hydrolysis 

The Ras isoforms are rather slow GTPases. In the absence of catalytic partners 
the intrinsic rate of GTP hydrolysis is 0.01/min for H-Ras [163] and 0.02/min for 
N-Ras [165] at 37°C. Oncogenic mutations in Ras (which in most cases concern 
codons 12 for glycine or 61 for glutamine) result in an approximately tenfold 
reduction of the intrinsic GTPase rate [166-168], and complete loss of activa- 
tion by GAPs (see Sect. 3.2.1). Therefore the ratio of Ras • GTP and Ras • GDP in 
the cell becomes unbalanced in favor of the GTP state leading to permanent (and 
uncontrolled) signaling for proliferation. 

GTP hydrolysis by Ras, which is considerably slow, has been analyzed, usually 
by single turn-over measurements. Ras is loaded with GTP by incubation of 
Ras (being in the GDP form after preparation) with a large excess of GTP in the 
presence of EDTA, which leads to faster dissociation of the nucleotide. After 
change of buffer and removal of excess nucleotide, aliquots are taken at certain 
time intervals and the nucleotide composition is analyzed with the help of filter 
binding assays or HPLC. Series of experiments with mutations in the neighbor- 
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hood of the phosphates and the attacking water (see below) led to a correlation 
between the resulting pK^ value of the f-phosphate (titrated by NMR) and the 
rate of GTP hydrolysis [169]. The authors concluded that the y-phosphate is the 
general base, rather than any amino acid residue, which activates the water 
molecule for nucleophilic attack. 



3.2 

Regulators and Effectors 



3.2.1 

GTPase Activating Proteins 

The slow intrinsic GTP-hydrolysis by Ras can be accelerated by orders of magni- 
tudes upon interaction with its GTPase activating proteins (GAPs). The cytoso- 
lic RasGAPs and NFl (neurofibromin) are the main factors, which 

ensure that cellular Ras exists predominantly in its inactive GDP-complexed sta- 
te [26]. The mechanism of GTP-hydrolysis and its stimulation has been the 
object of controversial disputes for over a decade. 

The hydrolysis of mant-GTP bound to Ras can be monitored by a slight 
decrease in fluorescence. Binding experiments of N-Ras with the non-hydro- 
lyzable GTP-analogue mant-GppNHp showed a biphasic increase in fluorescen- 
ce. The slow phase had the same amplitude as the decrease observed for the 
hydrolysis of Ras • mant-GTP which led to the hypothesis that a conformational 
change in the Ras protein proceeds GTP-hydrolysis [170] and represents the rate 
limiting step: 

Ras • mant - GTP => Ras* • mant - GTP => Ras • mant - GDP + P, 

In this model the function of RasGAP was to accelerate the isomerization step 
and to bring Ras into a conformation competent for GTP cleavage without furt- 
her support of GAP residues. A different analysis of the macroscopic readout 
interpreted the small change in amplitude as the hydrolysis reaction or the 
release of inorganic phosphate, and rejected the model of a GAP-induced con- 
formational change in Ras [171], but this was again disputed by supporters of 
the conformational model [172]. 

Introduction of fluorescence anisotropy methods allowed direct measure- 
ment of Ras binding to pl20‘^^^ and NFl, respectively. The affinities for 
Ras • mant-GppNHp and the catalytic domains of the two Ras-GAPs were rather 
low, with a Kj 3 around 20 pmol/1 with pno^^*” and around 1 pmol/1 with NFl 
[173]. It was demonstrated by this technique that a pl20‘^**' mutant, which was 
deficient in catalyzing the GTP-hydrolysis in Ras, can still bind Ras • mant-Gpp- 
NHp (Fig. 15). This observation is in support of the hypothesis of direct in- 
volvement of GAP residues in GTP hydrolysis. 

The weak binding of pi 20°^^ to Ras in the GTP-state was enhanced consider- 
ably when GTP (or GppNHp) was replaced by GDP and fluoroaluminate. In the 
a subunits of heterotrimeric G-proteins A1F4^ and GDP were shown to mimic a 
kind of transition state of GTP hydrolysis. For Ras this state could only be shown 
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Fig. 15 . H-Ras • mGTP shows a transient increase in fluorescence signal when mixed with the 
catalytic domain of the Ras-GAP neurofibromin (NFl). When wildtype NFl is applied (gray 
trace) a decrease in signal follows, indicating the hydrolysis of mant-GTP. A NFl mutant 
bearing an alanine at the position of the catalytic arginine residue (black trace) can only bind 
to Ras • mant-GTP (increase) but cannot induce mant-GTP hydrolysis (no decrease). Figure 
kindly committed by Reza Ahmadian 



in the presence of additional Ras-GAP, indicating an additional stabilization of 
the transition state by pi 20°^^ and NFl [174]. 

A further understanding of the GAP-mechanism was achieved by the argi- 
nine-finger hypothesis, where the role of a highly conserved arginine residue in 
Ras-GAPs of several species in completing the catalytic machinery of Ras was 
demonstrated by a combination of biophysical and structural (see Sect. 3.3.2) 
experiments [175]. 

In actual research, setups developed for the analysis of Ras interaction are 
adapted for the screening of new therapeuticals, e. g., chemicals which overcome 
the lack in GAP-stimulated GTP-hydrolysis of oncogenic Ras mutants [35]. 

3.2.2 

Exchange Factors 

Guanine nucleotide exchange factors (GEFs) are the immediate activators of 
Ras. Upon interaction of a GEF with Ras the binding to the nucleotide is 
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weakened and its dissociation is accelerated. According to the large molar excess 
of GTP over GDP in the cell, it is predominantly GTP which rebinds to Ras and 
transforms it into its active state. The catalytic domain responsible for GEF 
activity is present in different exchange factors, like Sos, RasGRF, Scd25, and 
Cdc25, and this protein fragment has been used for many studies. 

A detailed biochemical analysis by Gool and coworkers has revealed the 
mechanism of nucleotide exchange promoted by GEF. By use of mant-labeled 
guanine nucleotides and SPR they have studied the kinetics and equilibria of 
nucleotide and exchange factor binding to Ras [176]. The equilibrium dissocia- 
tion constants (Kd) for the binary Ras/ nucleotide and Ras/exchange factor com- 
plexes lie in the picomolar and nanomolar range, respectively. The ternary com- 
plex may dissociate either by nucleotide release, the value of which lies in the 

micromolar range, or by exchange factor dissociation with a value in the mil- 
limolar range [177]. This means that exchange factor binding to Ras lowers its 
nucleotide affinity by six orders of magnitude. The microscopic process respon- 
sible for this drastic change was proposed to be the conformational switch from 
a tight to a loose binding state which was identified to be rate limiting for 
nucleotide dissociation and which is accelerated by more than 10^-fold in the 
ternary complex [177]. Notably, the exchange factor is able to displace the 
nucleotide from Ras but, vice versa, rebinding nucleotide does induce dissocia- 
tion of the exchange factor from the ternary complex. In the same year the 
underlying structural details were published which are discussed below. 

Dominant-inhibitory mutants have been very helpful in elucidating the biolo- 
gical effects of Ras in diverse cellular systems [178]. The most widely used 
mutant is RasS17N bearing two biochemical features which lead to the suppres- 
sion of Ras function. This mutant fails to bind to downstream effectors but it 
binds to GEFs, resulting in sequestering the activator for endogenous Ras in an 
unproductive complex. It is not so much the (fivefold) tighter binding to the 
exchange factor but more the ( 1 000-fold) weaker binding to the nucleotide which 
favors S17N over wild type Ras in binding to the exchange factor. According to 
basic thermodynamic considerations this leads to a 5000-fold stronger binding of 
exchange factor to S17N than to wild type Ras in the presence of nucleotide, i. e., 
in the cell. The mutant D1 19N has a similar effect as S17N, leading to suppression 
of endogenous Ras function. In contrast to SI 7N this mutant shows transforming 
activity at higher concentrations because it has not lost the ability to bind to the 
effectors. At low concentrations D1 19N acts as an inhibitor on wild type Ras pre- 
sent in the cell by sequestering the exchange factor [ 179]. At high concentrations 
it activates the effectors because it gets into the GTP form in a GEF-independent 
manner due to its high intrinsic nucleotide dissociation rates [180]. 

Mutants corresponding to S17N in Ras-related proteins like Rho or Ran have 
created dominant-inhibitory effects likewise, despite the fact that completely 
different exchange factors are involved. This may be based on the fact that 17N 
impairs Mg ion binding and therefore generally leads to weaker nucleotide 
binding, resulting in the effect described above. For Rap, closely related to Ras, 
contradictory data about the S17N mutant are reported but no efficient domi- 
nant-negative mutant could be identified so far, demonstrating that weaker 
nucleotide binding does not necessarily lead to the inhibitory effect. Despite 
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high structural similarity between Rap and Ras and between the respective 
exchange factors C3G and Sos/Cdc25, van den Berghe et al. could point out some 
subtle differences in the importance of distinct residues for the two systems 
which might explain the different behavior [181]. Nevertheless, an efficient 
dominant-negative mutant of Rap is still sought, as it is so crucial for studying 
biological effects of Rap. 

3.2.3 

Multiple Effectors 

Many different biological effects are known for Ras which depend on the type 
and the state of the cell. It is therefore not surprising that so many potential Ras 
effectors were identified. The degree of activation of each pathway, triggered by 
the interaction of Ras with the respective effector, is modulated by the input of 
other signals in parallel or counteracting pathways. Obviously, synergism is the 
keyword to understand the complex processes in regulation of cellular func- 
tions. Rap may be an example that even a closely related Ras protein, possibly by 
competition, can contribute to suppression of Ras function [100]. Characteriza- 
tion of Ras effector interaction has been extensively reviewed [55, 153]; therefore 
only some aspects which have to do with Ras itself will be discussed here. 

The methods used for identification of new Ras targets, e.g., two hybrid 
screening in yeast, are of a qualitative nature and were able to demonstrate inter- 
actions of the effectors with Ras relatives as well. Further characterization of all 
possible interactions between the Ras-related proteins on the one side and the 
different effectors on the other was necessary in order to differentiate true part- 
nerships. Many of the quantitative methods resulting in estimates of the binding 
affinity require immobilization of Ras or the effector. Another possibility to 
quantify Ras/effector interaction is competing GAP binding to Ras taking the rate 
of GTP hydrolysis as the readout [42]. For the equilibrium dissociation constant 
(Kn) values are reported in the range 5-500 nmol/1, the big variance of which is 
possibly caused by different experimental conditions. Another method based on 
the inhibition of guanine nucleotide dissociation (GDI assay) allowed systematic 
determination of the values for all proteins in solution [182, 183]. A high 
degree of specificity for the members of the Ras subfamily became evident in res- 
pect of their interactions with the different effectors [184, 185]. The values 
cover three orders of magnitude and the values for Ras are in the nanomolar ran- 
ge with some effectors and in the micromolar range with others. Intriguingly, Rap 
shows both high and low affinities but to other effectors as Ras. For instance. Rap 
binds strongly to RalGDS and weakly to Raf, and, in contrast, Ras binds tightly to 
Raf but weakly to RalGDS. The molecular basis for this pronounced specificity 
was discovered to be residue Lys31 in Rap and Glu31 in Ras, respectively [186]. 
The partner residue on the Raf side is Lys84, being attracted by oppositely charg- 
ed Glu31 and repelled by Lys31. In the case of RalGDS a patch of acidic residues 
is suggested to be the counterpart of residue 31 in the Ras protein [187]. 

It is worthwhile to note that abrogation of interaction as detected in biologi- 
cal assays, often reported for mutants or other modifications, does not really 
mean complete loss of binding. Ras • GDP binds Raf-RBD 1000 times less strong- 
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ly than Ras in the GTP state, i. e., with a value still in the micromolar range 
[182]. Other differences are far more subtle. The above-mentioned partial loss of 
function mutants, which refers to the observation that certain mutations in Ras 
lead to the loss of binding to distinct effectors, shows this clearly. For example 
E37G can still activate RalGDS but not Raf whereas T35S acts in the opposite way 
[36, 37]. Other examples are known, including selective activation of PI(3)K 
(D38E, Y40G) [127]. When measuring the increase of the Kj, values for these 
mutants, it turns out that there are no dramatic differences with the effectors. 
Apparently a fivefold difference between the values of Raf and RalGDS with Ras 
T35S (Herrmann, unpublished) is sufficient to result in a plainly perceptible 
alteration of the biological readout. These observations also suggest that effec- 
tors or Ras proteins have to pass concentration thresholds, which are in relation 
to the respective values, in order to provoke a biological effect. This again is 
explainable by the high degree of synergism and cooperation in cellular signal 
transduction. Therefore, when detection limits in cellular assays are turned up 
by heavily overexpressing one of the interaction partners, the results should be 
interpreted with caution. The issue discussed here may also have implications 
for drug development. Only slight inhibition of interaction can be enough to 
achieve the desired effect - the main requirement for the drug remains the spe- 
cific binding to one target. Both modest inhibition and high selectivity may lead 
to little side effects. 

The dissociation of Ras and Raf was shown to be very fast, in contrast to the 
intuitive belief that a stable complex should be long-lived [188, 189]. This high- 
ly dynamic equilibrium of Ras/Raf interaction ensures that GAP, which does not 
bind to Ras simultaneously with Raf, can turn off Ras in between. The mecha- 
nism of binding was addressed by stopped flow kinetics [189]. Eor the rate con- 
stant of Ras/Raf association saturation behavior was observed with increasing 
concentration, indicating a two-step binding mechanism. Presumably, forma- 
tion of a weak complex is followed by a conformational change of one of the pro- 
teins (induced fit), being rate limiting at high protein concentrations. 



3.3 

Structure 

One may say that most structures relevant for understanding Ras regulation and 
function have been solved, with some limitations. Ras is known in its active and 
inactive state, but with the 20 G-terminal amino acids including the hydropho- 
bic modifications missing. Eor structure determination of the regulators in com- 
plex with Ras, the catalytic fragments of the proteins were used. The reason for 
this is the difficulty in obtaining large proteins in a soluble and stable form in 
reasonable quantities and at high concentrations. Eor the same reason the struc- 
tures of the Ras binding domains of the effectors in complex with Ras only are 
known. Recently the structure of PI(3)Ky was solved, revealing the same topolo- 
gy of the RBD as found for other effectors (see below) but the binding to Ras 
could only be modeled in this case [190]. The following paragraphs describe 
structural aspects of Ras activation and deactivation and of the interaction with 
downstream targets. 
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3.3.1 

Paradigm of GTPase 

Although the structure of Ras, complexed to the non-hydrolyzable GTP analog 
GppNHp and solved by X-ray analysis [21, 191], was not the first in the superfa- 
mily of GTP-binding proteins, being second to ribosomal EF-Tu [192], Ras is 
established as the prototype for this large family. Ras represents the minimal G- 
domain, which was found later for many other Ras-like proteins such as Ran, 
Rab, and Rho proteins [193- 196], in larger GTPases like in Ga [197] and, most 
recently, in hGBPl, a member of the subfamily of large GTPases [198]. The cen- 
ter of the G-domain is constituted by a six-stranded jS-sheet with all strands run- 
ning in parallel except j82 at the edge of the protein, representing the effector 
region in part (see Fig. 16). Again, with one exception, between /52 and j33, an a- 
helix is inserted each between the ^-strands, kind of positioned around the cen- 
tral sheet. The structure of Rap, a member from the same subfamily, is almost 
identical to that of Ras [199]. Members from other subfamilies of small GTPases 
have a similar structure, each with specific features, like an additional a-helix in 
Ran embracing effector-like proteins [200]. The larger relatives of Ras also show 
the Ras G-domain core, according to their size with a few insertions like the heli- 




Fig. 16. X-ray structure of Ras in complex with GppNHp and a ion. Protein in ribbon 
presentation and the nucleotide in ball and sticks. The secondary structure elements are 
connected by short peptide loops, where loop 2 and 4 (see text) are in dark grey 
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cal GAP domain in G„ (see below) or the effector domain in EF-Tu [201]. Even 
in the structure of hGBPl from the family of large (dynamin-related) GTPases, 
the Ras G-domain is found, also with a few insertions with presumably specific 
functions and two large helical domains extended at the C-terminus [198]. 

The most important property and common to all Ras-like proteins is the 
ability to bind GTP and GDP and to adopt different conformations, which is the 
basis of their switch function. As all the GTP-binding proteins have different 
regulators and effector proteins it is not surprising that all conserved amino acid 
motifs are concerned with nucleotide binding. They are almost exclusively 
located in the peptide loops connecting the secondary structural elements 
described above. Residues in loops 8 and 10, carrying NKx(D/T) and SAK 
motifs, respectively, interact with the guanine base of the nucleotide. Aspll9 in 
loop 8 contributes most strongly to recognition of the nucleotide due to the 
formation of two specific hydrogen bonds to the guanine base. Loop 1 , also known 
as P-loop and representing the GxxxxGK(S/T) motif, and loops 2 and 4 build the 
catalytic center in Ras by coordinating the phosphate groups with main chain 
and side chain atoms. Particularly amino acids Lysl6 and Gln61 from loops 1 
and 4, respectively, are believed to be of immediate catalytic importance, with a 
set of further residues being important for their proper positioning. 

When comparing to the structure of GDP-bound Ras, the largest structural 
change is observed in loop 2, and to a lesser extent in loop 4 [22]. The former is 
also known as effector loop (followed by jS2, see above) whereas the latter car- 
ries catalytic residues like Gln61 and, due to their conformational switching be- 
tween the GDP and GTP states, they are also referred to as switch 1 and 2, res- 
pectively. In the GTP state these two loops are held in position by main chain 
contacts of Thr35 (invariant in all sequences) and Gly60 (from DxxG motif) to 
the y-phosphate, respectively. After hydrolysis of GTP and subsequent loss of the 
phosphate the two loops move apart which may be regarded as a relaxation to a 
lower energy conformation [202]. Apparently, GTP forces Ras into a conforma- 
tion amenable to effector binding. 

The nucleotide is bound more tightly to Ras in the presence of Mg^+, and this 
ion supports catalysis of GTP hydrolysis. It is bound through 6 oxygens near to 
ideal octahedral coordination. One oxygen each is contributed by p- and y- 
phosphates, and the side chains of S17 (see exchange factors) and Thr35. The 
two apical positions are occupied by water molecules which are stabilized by 
Asp33 (from the effector loop) and Asp57 (DxxG motif), respectively. The 
contact to the Thr35 side chain and, obviously, to the y-phosphate oxygen is lost 
after GTP hydrolysis but Mg^"^ remains bound to Ras • GDP. Altogether many 
features have been identified in the Ras structure which are responsible for the 
catalytic activity, e.g.,by stabilizing negative charges on the y-phosphate by con- 
veying leaving group characteristics to GDP and, alternatively, by positioning 
and activating a water molecule for in-line attack of the y-phosphate. 

Time-resolved X-ray crystallography has brought further insight into the 
mechanism of GTP hydrolysis and has confirmed former conclusions. With this 
method it was possible to obtain the structure of Ras bound to GTP, rather than 
non-hydrolyzable analogs like GppNHp or GppGH 2 p, and, moreover, to follow 
the structural changes in Ras due to GTP hydrolysis [203]. Initially,Ras is bound 
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to a photolabile and non-hydrolyzable compound, caged GTP, and set up for 
crystallization. Tbe crystal mounted in an X-ray beam is flashed with UV-light 
which leads to cleavage of the protecting 2-nitrophenylethyl group leaving 
Ras • GTP behind. A white X-ray beam is now irradiated which allows data 
collection within a few minutes according to the Laue method. As GTP hydroly- 
sis in the crystal is as slow as in solution (half-life of 40 min at room temperatu- 
re) the structure of Ras • GTP and snapshots during the time course of hydroly- 
sis can be obtained. GppNHp and GppGHjp seem to be suitable analogs since 
the structure of Ras with GTP is very similar to those with the analogs showing 
the catalytic features described above. Comparison to the Ras • GDP structure, 
obtained by monochromatic X-ray diffraction after complete hydrolysis, shows 
nicely the structural changes in switch l,here particularly residues 32-36 (note 
around Thr35), and in switch 2. Usage of pure diastereoisomers of caged GTP 
and rapid freezing technique led to higher resolution of the Ras • GTP structure 
[204]. Comparison of the high resolution structures of GTP- and GppNHp- 
bound Ras showed that they are almost identical, implying that there is no rate 
limiting conformational change between two GTP states (one being represented 
by the equilibrated GppNHp structure) prior to hydrolysis. This GTP structure 
also contributed to the discussion about the importance of distinct water mole- 
cules and catalytic residues which shall not be further detailed here. 

Ras in complex with caged GTP was also used in time resolved infrared spec- 
troscopy. After vibration band assignment with help of isotope labeling the phos- 
phate groups of Ras-bound GTP could be studied, revealing lower frequency for 
the jS(P02) vibration compared to free GTP [205]. This was taken as an indica- 
tion of extensive interaction of the /^-phosphate with the protein environment, 
leading to electron withdrawal from the jS/y-bridging oxygen, and hence to bond 
weakening between this oxygen and y-phosphorus atom. The authors see the 
splitting of GTP as “preprogrammed” by the j0-phosphate interactions with Ras. 



3.3.2 

Regulators and Effectors 



3.3. 2.1 

Exchange Factors 

Kuriyan and coworkers used a fragment of the exchange factor Sos comprising 
the catalytic domain and in addition, as it turned out after structure determina- 
tion, a region which stabilizes the catalytic region [206]. Both are purely helical 
and the catalytic domain is described as a bowl in the center of which Ras is 
bound. Interactions of Ras with Sos are observed for the P-loop, switch 1 and 2, 
as anticipated, and for helix a3. Hydrophobic residues from switch 2 act as a bin- 
ding anchor buried into the hydrophobic core of Sos. Most further contacts are 
hydrophilic in nature, opening the way to fast displacement of Sos by competing 
water molecules after nucleotide binding. An obvious catalytic feature is an a- 
helical hairpin protruding from Sos into the nucleotide binding pocket of Ras 
which is nucleotide-free in this structure. It shifts away switch 1, leaving big 
structural changes behind in this functional element of Ras (compared to the 
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Structure of nucleotide-bound Ras). At the same time it inserts a hydrophobic 
residue, thereby blocking magnesium binding, and positions an acidic side 
chain at the site normally occupied by the a-phosphate of the nucleotide. Also 
Sos-induced changes of switch 2 are made responsible for impaired nucleotide 
binding. In consideration of the need for a non-durable competition of exchange 
factor and nucleotide for binding to Ras, and in view of the mechanism 
described above, the authors point out an intriguing observation in the Ras/Sos 
structure. Whereas the binding sites for the phosphate part of the nucleotide 
and the associated magnesium ion are disturbed by Sos, the binding pockets for 
the ribose and the base of the nucleotide are not fundamentally altered. This 
allows gradual rebinding of the nucleotide to Ras by entering the guanine base 
and the ribose moiety and then expelling of Sos by the phosphate groups. 
Regardless of which of the conformational changes is the rate limiting event for 
overall nucleotide exchange, both, biochemical and structural studies have 
shown that more than disruption of magnesium ion binding is responsible for 
efficient enhancement of nucleotide exchange rates. 



33.2.2 

GTPase Activating Proteins 

The breakthrough in crystallization of GAP in complex with Ras was the disco- 
very that GDP and AIF4, together, are bound by Ras in the presence of stoichio- 
metric amounts of GAP, thereby mimicking a transition state of GTP hydrolysis 
[174]. The historic background is that G„, bound to GDP, was known to be activa- 
ted by AIF4, which occupies the position and imitates the role of y-phosphate of 
GTP. The reason for this is that G„ carries a helical domain, inserted into its 
canonical Ras-like G-domain, which is responsible for the higher GTPase activi- 
ty in comparison to Ras by contributing additional catalytic amino acid side 
chains. These residues are interpreted to stabilize the transition state of 
GTP hydrolysis as evidenced in the three-dimensional structure [207]. Now, 
Ras • GDP alone is not able to bind AIF4 except in complex with GAP which 
brings the helical domain with the stabilizing residues into the game. Also Rho 
in complex with Rho-GAP could be crystallized by use of GDP and AIF4 as GTP- 
or transition state-analog [208]. 

At first glance the structure of the catalytic GAP fragment resembles that of 
the exchange factor as it is purely helical, forming a “shallow groove” [209] rat- 
her than an “oblong bowl” [206]. Ras seems to dive in the same orientation into 
this groove/bowl and the contacts are again described to be via the P-loop, 
switch 1 and 2, and helix a3. A closer look reveals many important differences of 
course, explaining many biochemical findings and the high degree of conserva- 
tion of distinct residues in the family of GAP proteins. Most interesting is a loop 
exposed from the GAP fragment towards the nucleotide binding site of Ras and 
allowing an arginine to penetrate into the phosphate region. This “arginine fin- 
ger” is interpreted, from biochemical experiments hypothesized, as the GAP 
residue which contributes catalytic assistance to GTP hydrolysis on Ras. It 
appears that a negative charge developing on the y-phosphate during GTP 
hydrolysis is stabilized by the arginine side chain, leading to faster hydrolysis. 
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This is taken as further evidence for an associative (SN2-like) mechanism of GTP 
hydrolysis. More details about this debate are reviewed by [210]. 

A further catalytic effect of GAP binding is the stabilization of switch 2, 
thereby fixing Ras residue Gln61 in a catalysis-competent position close to the 
attacking water molecule. For Gln61, one of the two most frequently mutated 
amino acids found in oncogenic Ras, the involvement in GTP hydrolysis was 
apparent from the Ras structure alone. Yet its exact role in the mechanism is not 
clear. Now the oncogenic effect of mutated Glyl2 could also be rationalized by 
the Ras/GAP structure. Any amino acid side chain at this position leads to a 
clash with the main chain of the arginine finger, presumably leading to deran- 
ged conformations in the catalytic center of the complex. 

It seems difficult to derive strategies for the development of anti-cancer drugs 
from the Ras/Sos structure because oncogenic Ras can fully dispense with 
exchange factors. In contrast, the Ras/GAP structure may help to create new ideas 
on how hydrolysis activity of mutated Ras might be restored by addition of a 
GAG (= GTPase activating compound). A first approach is given in the investi- 
gation of the mechanism of DABA-GTP hydrolysis [35]. See also [235] for review. 



3.3.23 

Ras Binding Domains of Effectors 

In one of the first papers identifying Raf as Ras effector, the Ras binding domain 
(RBD) was defined as well [41 ]. By comparison of Ras-binding fragments of Raf 
obtained in a two-hybrid screen, the minimal binding region was mapped to 
residues 51 - 131 in c-Raf-1. Two years later the NMR structure of this fragment 
in solution and the X-ray structure in complex with the Ras homolog Rap were 
solved [199, 211]. The Raf- RBD shows the ubiquitin superfold consisting of a 
five-stranded ^-sheet, a three-turn, and a one-turn a-helix (see Fig. 17). The 
interaction with the Ras protein is mediated by residues of the N-terminal /)- 
hairpin, thereby forming an apparent antiparallel ^-sheet with ^2 and ^3 from 
the Ras side. In addition, Raf residues from the G-terminal end of the a-helix 
contribute further contacts to the effector region of Ras. Apart from a small 
hydrophobic patch the interaction is mediated predominantly by oppositely 
charged or hydrophilic side chains forming specific salt bridges and hydrogen 
bonds, respectively. The different specificities of Ras and Rap in respect to Raf 
binding, which are caused by residue 31 each as described above, can be com- 
prehended in the structures. Whereas Raf residue Lys84 is pushed out by Lys31 
in Rap, it forms a stable salt bridge with Glu31 and Asp33 in the Ras-like com- 
plex [186]. NMR experiments in solution suggest that the salt bridge partner of 
Lys84 is Asp33 rather than Glu31 [212]. 

Again two years later and another two years later the structures of the RBDs 
of RalGDS and PI(3)K were published, respectively, the former as NMR and X- 
ray structure [213, 214], and the latter as part of PI(3)Ky in full length [190]. 
Intriguingly, all RBD structures of effectors known so far show the same topolo- 
gy despite their lack of sequence homology. The mode of docking to the Ras pro- 
tein is similar as well, as described above for Raf-RBD, and also observed in the 
Ras/RalGDS complex [187, 215] and as modeled for Ras/PI(3)Ky [190]. When 
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Fig. 1 7. The X-ray structure of Rap, highly homologous to Ras, in complex with Raf-RBD. The 
structure of Rap is almost identical to the one of Ras (see Fig. 16, Rap shown here in a dif- 
ferent orientation). Raf-RBD is shown on the left 



comparing uncomplexed RBDs in complex with Rap or Ras, no drastic struc- 
tural changes are observed, which suggests that Ras binding does not induce 
significant structural changes of the RBD, thus not favoring an allosteric model 
of effector activation. 

Ras, at the interface to the RBD, exposes predominantly negatively charged 
amino acid residues. Accordingly, arginines and lysines are found on the RBD 
side of Raf and RalGDS, each giving rise to specific recognition of Ras and effec- 
tor proteins. Despite all these structural similarities, an important difference in 
the thermodynamics is observed for Raf and RalGDS interaction with Ras. 
Whereas Raf binding is favored by a positive entropy change, RalGDS binding is 
accompanied by a huge decrease in enthalpy in order to compensate for a nega- 
tive entropy change (Herrmann, unpublished). In order to design drugs based 
on the structure of their protein targets it was desirable to understand the quan- 
titative relation between structure and energy of interaction, but as yet we are 
quite a bit away from this. 



3.3.3 

NMR Analysis 

The NMR solution structure of Ras • GDP [216] is very similar to the X-ray 
structure [22], except the position of helix 2, which matches neither with the cor- 
responding helix in the GDP nor in the GppNHp X-ray structure, respectively. 
The authors argue that such “energetically finely balanced conformational 
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states” detectable in a solution structure may be masked by crystal packing forces 
[216]. Nevertheless, the same regions, namely loops 2 and 4, are found to be 
mobile which demonstrates that ill-defined electron density in the X-ray struc- 
ture is not due to crystallization artifacts but corresponds to loop dynamics, at 
least in this case. From measurements of the heteronuclear NOEs and of Tj and 
T 2 in ^^N-labeled Ras, the dynamics are calculated to lie in the nanosecond time 
range. 

Another type of NMR experiment revealed equilibration of two or more sta- 
ble conformations of the Ras protein. ^'P-NMR spectra of GppNHp bound to Ras 
recorded at 5°C show the resonances of the a and jS phosphorus atoms to be split 
into two lines which coalesce at higher temperatures (30°C) [217]. The two dif- 
ferent chemical shift values observed were each interpreted to result from the 
positioning of Tyr32, which may be oriented towards the phosphates in one con- 
formation and pointing away in the other. The dynamics of this flip were cal- 
culated by line shape analysis to be in the ms time range. Consistent with the 
implied mobility of the effector loop, one of these two states could be stabilized 
by the addition of effector protein. One of the two states is preferred by the inter- 
acting effector, resulting in only one resonance line for each phosphorus atom. 
Similar conclusions were made after analyzing line widths of fully isotope 
labeled Ras [218]. Extreme broadening was observed for residues of loop 2 and 
4 (switch 1 and 2), and part of loop 1, taken as indication for “regional polyste- 
rism” of the Ras molecule in the GTP-bound form. The authors discuss the pos- 
sibility that Ras, with many different interaction partners which bind to loop 2 
and 4, presents different conformations one of which is selected by a distinct 
partner. Again, binding of Raf-RBD resulted in fixation of one of the polysteric 
forms. 

The solution structures of Raf-, Rif-, and RalGDS-RBD were solved and, in 
addition, NMR spectroscopy was used to probe the interaction with Ras [211, 
213, 219]. Only the RBD was labeled with ^^N and the shifting of distinct cross 
peaks (HSQC spectra) due to binding of Ras allowed one to identify the binding 
surface on the RBD. Gonsistent with X-ray structures of the complexes (see 
above) in both Raf and RalGDS, amino acid residues located in f5l and ^2 and 
the G-terminal part of al are involved in the interaction with Ras. 



3.4 

Posttranslational Modification 

Ras proteins and almost all other GTP binding proteins involved in signal trans- 
duction, regulation, and intracellular transport are anchored in membranes via 
hydrophobic modifications such as isoprenylation and palmitoylation [220]. Ha- 
and N-Ras as well as K-Ras4A undergo four post-translational modification 
steps starting with the prenylation of the last G-terminal cysteine in the GaaX 
box of the polypeptide chain by a protein farnesyltransferase. Subsequent remo- 
val of the last three G-terminal amino acids by a prenyl protein specific endo- 
protease and donation of a methyl group to the S-farnesylated cysteine by a 
prenyl protein specific methyltransferase occur in the endoplasmatic reticulum. 
Modification is completed by palmitoylation of one or two cysteines by a puta- 
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Fig. 1 8. Interaction between Ras and its regulators and effectors at the plasma membrane. Bind- 
ing of a growth hormone (GH) at the extracellular binding site of its receptor (RTK, receptor 
tyrosine kinase) leads to autophosphorylation of intracellular tyrosines of the RTK, which 
allow binding of the cytosolic adapter protein Grb2 and thereby brings the Ras guanosine 
exchange effector Sos to the plasma membrane. Sos now catalyzes the exchange of GDP com- 
plexed with Ras by GTP. Thereby, activated Ras • GTP now recruits Raf-kinase to the plasma 
membrane, where Raf initiates a cascade of phosphorylation reactions leading to cell division 
or differentiation. GTPase activating proteins like RasGAP stimulate the intrinsic GTP-hydro- 
lysis in Ras ■ GTP up to 10^-fold and bring Ras back to its resting state. While Ha-Ras is in- 
serted in the plasma membrane with its farnesyl group and up to two palmitoylthioesters at 
its C-terminus (shown on the right), the isoprenoyl membrane anchoring of K-Ras4B is sup- 
ported by electrostatic interaction between a positively charged poly-lysine stretch and the 
negative head groups at the inner side of the plasma membrane (shown on the left) 



five prenyl protein specific palmitoyltransferase presumably localized af fhe 
plasma membrane. K-Ras4B undergoes the same modifications as the other Ras 
isoforms, but is not palmitoylated and achieves its membrane association by a 
combination of hydrophobic (farnesyl group and carboxymethylation) and 
elecfrostatic interaction of its basic C-terminus with the negatively charged 
plasma membrane [221]. All interactions between Ras proteins and their effec- 
tors or regulators have to occur at the inner side of fhe plasma membrane 
(Fig. 18). If fhe membrane localization of Ras is prevented, e.g.,by inhibitors of 
the farnesylation reaction, even oncogenic Ras mutants cannot transform a cell 
- an idea which is sfrongly invesfigafed for cancer treatment [222]. 

3.4.1 

In Vitro vs In Vivo 

All steps of the Ras pathway from ligand binding to receptor tyrosine kinases, 
down to activation of effectors like Raf kinase, occur at the plasma membrane. 
However, most biophysical studies on protein/protein interactions involved in 
this scenario have been carried out with bacterially synthesized proteins lacking 
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C-terminal modification. This was caused by big problems in the synthesis and 
isolation of completely modified proteins [223] and good accessibility to the 
non-modified proteins expressed in E. coli. 

Of course, insertion of proteins into intracellular membranes may have effects 
upon the kinetic and thermodynamic properties of the corresponding biologi- 
cal interactions. Direct contributions of hydrophobic modifications of the Ras 
proteins to their interaction with regulators and effectors have been demonstrat- 
ed or are the subject of discussion. Catalytic activity of the exchange factor hSos 
(human Sos) with H-Ras is stimulated by the isoprenylation of Ras [224] and a 
similar effect is described for smgGDS (small guanosine dissociation stimula- 
tors) and K-Ras [225]. The G-protein responsive phosphoinositide 3-kinase 
pllOy shows markedly increased binding of Ras when Ha-Ras or K-Ras4B are 
farnesylated [226]. Additionally, the membrane itself can contribute to further 
modifications of the protein-protein interactions. It can provide additional elec- 
trostatic and hydrophobic interactions distinct from the lipid anchorage and 
thereby affect conformation and/or activity of membrane associated proteins. 

3.4.2 

Artificial Membranes 

A first approach to study the interaction of posttranslational modified Ras pro- 
teins with membranes was the analysis of binding and exchange of isoprenyl- 
ated peptides with and between lipid vesicles utilizing a fluorescent bimanyl 
label. Studies with K-Ras peptides revealed that a single isoprenyl group is suf- 
ficient for membrane association only if supported by carboxymethylation of 
the C-terminal cysteine [227,228]. 

Stable membrane binding for K-Ras peptides was achieved when hydrophob- 
ic and electrostatic contributions act in concert. The effect of a combination of 
positively charged side groups and an isoprenoid modification upon membrane 
binding of the G-termini of K-Ras4B was synergistic [229]. Due to the long 
stretch of basic amino acids the electrostatic interaction of the K-Ras4B peptide 
with negatively charged vesicles results in an approximately 10^-fold increase in 
binding compared with a neutral membrane. 

Further investigations with bimanyl-labeled K-Ras4B peptides demonstrat- 
ed that relatively small differences in membrane charging (approximately 
10 mol %) are sufficient for an electrostatic enrichment in the more negative 
environment [230]. With the farnesyl group as a hydrophobic anchor, the pep- 
tide is still mobile and can swap between vesicles but may find its target mem- 
brane with the sensitive surface potential-sensing function of its lysine residues. 

The H- and N-isoforms of Ras support the first (isoprenoid) hydrophobic 
modification by additional thioester formation with palmitoylic acids [18]. At 
physiological temperature (37 °G) the dissociation of doubly modified lipo- 
peptides with an isoprenyl thioether and a palmitoyl thioester is very slow and 
characterized by half-times in the order of 50 h. Here, the relative effect of the 
carboxymethylation is significantly reduced. Palmitoyl groups with their Gjg 
alkane chain length contribute more efficiently to membrane anchoring than 
the farnesyl modification. 
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These findings led to the conclusion that the regulation of membrane 
anchored proteins has to be achieved by mechanisms other than spontaneous 
dissociation. In principal, binding to an “escort protein” or de-S-acylation may 
induce dissociation of the lipoproteins out of the membrane structure. 

As well as fluorescence-based assays, artificial membranes on the surface of 
biosensors offered new tools for the study of lipopeptides. In a commercial BIA- 
core system [231] a hydrophobic SPR sensor with an alkane thiol surface was 
incubated with vesicles of defined size distribution generating a hybrid mem- 
brane by fusion of the lipid vesicles with the alkane thiol layer [232]. If the 
vesicles contain biotinylated lipopeptides their membrane anchoring can be 
analyzed by incubation with streptavidine. Accordingly, experiments with lipo- 
peptides representing the C-terminal sequence of N-Ras show clear differences 
between single and double hydrophobic modified peptides in their ability to 
persist in the lipid layer [233]. 

New insights into the analysis of hydrophobically post-translational modified 
profeins could be achieved by the construction of lipidated proteins in a combi- 
nation of bioorganic synthesis of activated lipopeptides and bacterial expres- 
sion of the protein backbone (Fig. 19). The physico-chemical properties of such 
artificial lipoproteins differ substantially from those of the corresponding lipo- 
peptides. The pronounced dominance of the hydrophilic protein moiety (e.g., 
for the Ras protein 181 amino acids) over a short lipopeptide with one or two 
hydrophobic modifications provides solubility up to 10“^ mol/1, while the bio- 
tinylated or fluorescence labeled lipopeptides exhibit low solubility in aqueous 
solutions and can be applied in the biophysical experiments only in vesicle inte- 
grated form or dissolved in organic solvent. 

Thus lipoproteins could be injected over the surface of a lipid covered SPR 
sensor in a detergent free buffer solution, and showed spontaneous insertion 
into the artificial membrane [234]. Again, two hydrophobic modifications are 




Fig. 19 . Lipopeptides with the aminoacid sequence of the Ras C-terminus and the natural or 
artificial lipid-modifications can be coupled with C- terminally truncated Ras via a maleimi- 
docaproyl linker. This electrophile reacts with free thiol groups (here, a C-terminal cysteine at 
the Ras moiety) 
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Fig. 20. The surface of a hydrophobic SPR-sensor (covered with long chain alkanethiols) was 
enlarged to an artificial membrane by application of lipid vesicles with a defined size distri- 
bution. Application of a Ras-lipopeptide construct with both a farnesyl- and a palmitoyl- 
modification leads to increase in resonance signal assumed to indicate membrane insertion 
(grey trace). Washing with buffer induces the slow decrease in signal. A Ras protein without 
hydrophobic modification (black trace) does not lead to signal increase 



necessary for stable insertion into the lipid layer, whereas lipoproteins with a 
farnesyl group only dissociate significantly faster out of the membrane (Fig. 20). 
Therefore the isoprenylation of a protein is sufficient to allow interaction with 
membranous structures, while trapping of the molecule at a particular location 
requires a second hydrophobic anchor [19]. 

Interaction between the Ras protein and its effector Raf-kinase depends on 
complex formation of Ras with GTP (instead of the Ras • GDP complex, present 
in the resting cell). 

The SPR setup can now be applied to the study of interactions between mem- 
brane associated proteins and their effectors and regulators in a membrane 
environment mimicking the situation in the living cell. 



3.5 

Outlook 

Ras and its relatives are subjects of intensive investigations by biological, bio- 
chemical, biophysical, and medical studies. Within just one decade more than 
17,000 articles (Medline, 1966-2000) deal with function and properties of this 
protein. Structural and functional data, based on Ras as a prototype, have pro- 
vided insight into the basic principles of GTP-binding proteins, their activa- 
tion, de-activation, and signal transmission. 
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Here biophysical methods contributed a reductionistic approach. By analy- 
zing a limited number of actors under well defined conditions, the mechanisms 
of nucleotide exchange, intrinsic and stimulated GTP-hydrolysis, effector bind- 
ing, and membrane attachment have been elaborated to present a comprehen- 
sive model. 

Despite the data reviewed here, the work is not finished at all. Physico- 
chemical data have to be correlated with biological activity and the complexity 
of the living cell has to be reflected in the biophysical setups. This covers the use 
of full-length proteins instead of only functional domains, processed proteins 
instead of the straightforward bacterial expression, and the introduction of a 
membranous environment vs simple experiments in solution. 

Novel techniques as single molecule detection will focus on molecular lo- 
calization and dynamics and may open insights into the micro-compartmen- 
talization of cellular systems. 

In future, two developments may be expected. On the one hand, elaborated 
biophysical methods will measure interactions in the cellular context and diss- 
ect complex biological patterns. On the other hand, established physico-chemi- 
cal techniques mainly based on fluorescence will be utilized in high throughput 
screens to investigate new substances upon their interference with Ras function. 

For general literature on biophysical analysis, see [236-241]. 
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The protein Ras, an important intracellular signal transducer, is crucially involved in the devel- 
opment of tumor growth. The farnesylation of Ras, catalyzed by the enzyme Ras-farnesyl- 
transferase, is essential to its proper functioning in the normal and in the transformed state. 
Therefore, the inhibition of Ras lipidation has become a promising target for the development 
of new classes of anti-tumor agents. This review focuses on the different classes of Ras-farnesyl- 
transferase inhibitors and compares their biological properties and modes of action in vitro 
as well as in vivo. 
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1 

Introduction 

The protein Ras plays a key role in the cellular transduction of mitogenic signals 
given by external growth factors [ 1 ] . It is a central molecular switch that regulates 
cell growth, cell division, and differentiation. False regulation of the Ras pathway 
can lead to transformation of the cell. In fact, point mutations in the correspond- 
ing Ras genes are found in approximately 30% of all human tumors, particularly 
in over 90% of human pancreatic carcinomas and 50% of human colon cancers 
[2]. The crucial role of the Ras proteins in maintaining the regular life cycle 
of cells and the extensive involvement of mutated Ras in the development of 
numerous cancers has sparked the idea that influencing the function of Ras 
might open up new opportunities for the development of alternative anti-tumor 
drugs [3,4]. Disruption of the aberrant growth signals of oncogenic Ras proteins 
provides an example of what is now recognized as ‘signal transduction therapy’ 
[2 a]. Ras proteins must be localized to the plasma membrane in order to per- 
form their biological function in both the normal and the transformed state. 
Covalently attached lipids are the major driving force for membrane localiza- 
tion. Non-lipidated Ras is cytosolic and biologically inactive. Several post-trans- 
lational modifications are required at the carboxyl-terminal end of Ras before 
the initially biosynthesized precursor protein matures into a biologically active 
protein localized to the plasma membrane. Interfering with these processing 
steps might disrupt the aberrant growth signals of oncogenic Ras proteins. 

2 

Biochemical Lipid-Modification of Ras 

The Ras proteins are synthesized as biologically inactive, cytosolic precursor 
proteins. They are then modified by several post-translational processing steps 
at the carboxyl terminal end and thereby converted into biologically active 
proteins localized at the plasma membrane. The cysteine of the C-terminal 
CAAX sequence (C is cysteine, A is generally an aliphatic amino acid, and X is 
methionine, serine, alanine, or glutamine) is first enzymatically S-farnesylated; 
the AAX part is then cleaved off by a specific protease, and the free C-terminal 
cysteine is finally converted into a methyl ester (Scheme 1). 

In H- and N-Ras, additional cysteine residues in the immediate vicinity of the 
CAAX moiety are lipidated by formation of palmitic acid thioesters. Anchoring 
of the Ras proteins to the plasma membrane is mediated by the lipid groups. 
Genetic experiments have shown that farnesylation is essential to Ras function 
in the normal and the transformed state [5]. Palmitoylation appears to be 
not absolutely required for Ras cell-transforming activity but is important for 
conferring full biological activity on Ras [6]. The K4B-Ras protein compensates 
for the absence of a hydrophobic palmitoyl moiety with a positively-charged 
lysine-rich region near the CAAX box to enhance membrane interactions with 
negatively charged phospholipids. 

The finding that lipid-modification of Ras, in particular farnesylation, is 
crucial to its biological activity led to the idea that inhibiting the enzymes re- 
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unprocessed Ras protein 
(CAAX, p-turn) 



farnesyltransferase (FTase) 
FPP 




1 . protease, - Val-Leu-Ser 

2. methyl transferase 

3. palmitoyl transferase, 
PalCoA 




fully processed Ras protein 

Scheme 1 . Posttranslational modification of Ras protein with lipid groups (FPP: Farnesyl- 
pyrophosphate, PalCoA: Palmitoyl CoA) 
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sponsible for the modification would in turn inhibit Ras function and provide 
new approaches for developing potent anti-tumor drugs. 

3 

Structure of Ras-Farnesyltransferase and Possible Opportunities 
for Its Inhibition 

Farnesyltransferase is a heterodimeric protein composed of an a- and a jS-sub- 
unit, and zinc and magnesium ions are required for its activity [7, 8]. A closely 
related heterodimeric enzyme is geranylgeranyltransferase I (GGTase I). It also 
recognizes proteins with a CAAX-box when X is leucine. Both enzymes share the 
same a-subunit and the requirement for Zn^"^ and Mg^"^ but the jS-subunits are 
different. The similarity of these enzymes highlights the importance of selec- 
tivity of farnesyltransferase (FT)-inhibitors. 

Since the discovery that the Ras proteins are farnesylated numerous inhi- 
bitors of FTase have been developed. This field has been reviewed in detail and 
only a brief overview is given here [3, 4, 9, 10-12]. Initial studies on inhibitor 
development relied on information about substrate specificity as well as struc- 
tural studies of the substrate in solution and bound to the enzyme (obtained 
from NMR spectroscopic investigations) [13, 14]. The crystal structure of the 
free enzyme was reported at 2.25 A resolution [15]. The active site consists of 
two clefts which are at the junction of a bound zinc ion. The zinc coordinates the 
thiol of cysteine in a ternary complex. One of the clefts was found to be a hydro- 
philic surface groove near the subunit interface. It could correspond to the site 
of binding of the Ras protein. The second cleft is a region of aromatic residues. 
It is of a size that would accommodate the farnesyl moiety but not the longer 
geranylgeranyl pyrophosphate. In addition, the structure of the rat enzyme 
in complex with farnesyl-pyrophosphate was solved at 3.41 A resolution [16]. 
Recently, the analysis of four ternary complexes of rat-FTase co-crystallized 
with FPP analogs and KB4-Ras peptides was reported [17]. According to these 
studies based on crystal structures at 2 A resolution, zinc is essential for 
productive peptide binding, whereby the conformation of the zinc bound ligand 
seems not to be a ^-turn (see Scheme 1), in contrast to the results deduced 
from NMR-investigations [13, 14]. The three-dimensional information obtained 
from the X-ray structures should now spur the development of more efficient 
inhibitors. 

4 

Inhibitors of Ras-Farnesyltransferase 



4.1 

Inhibitors Based on the CAAX Motif 

Based on the CAAX motif, peptide analogues were designed in which peptide 
amide bonds are replaced by amine and ether groups [18]. In particular jS-turn 
mimetic 1 (Fig. 1) inhibits the FTase in vitro with an IC50 value of 1.8 nmol/1 and 
shows highly specific activity in comparison to inhibition of GGTase I. 
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R = H L-739.749 1 
R = CH3 L-739,750 2 



R = H B 956 3 

R = CH3 B 1086 4 




5 6 

Fig. 1. FTase inhibitors in which amide bonds were replaced by isosteric amines and ethers, 
and which incorporate non-natural amino acids 



The less polar methyl ester 2 as prodrug showed better results in vivo and 
inhibits both farnesylation of the Ras protein and growth of Ras-transformed 
cells, whilst proliferation of Raf- or Mos-transformed cells was not influenced. 
Growth of human pancreatic adenocarcinoma cells with mutated K-Ras, c-Myc 
and p53 genes was inhibited by application of 2. If the compound is administer- 
ed over a period of 5 days to mice with implanted Ras-dependent tumors, tumor 
growth can be reduced by up to 66% compared to untreated mice, whereas 
application of the antitumor antibiotic doxorubicin only resulted in 33 % reduc- 
tion under the same conditions. It is particularly noteworthy that treatment with 
the /1-turn mimetic - in contrast to treatment with doxorubicin - was without 
any visible side effects, such as weight loss. 

However, 2 also affected the regulation of actin stress fiber formation [19]. 
Rho proteins are involved in the regulation of various cytoskeletal structures, 
and RhoB is believed to be one of the prime targets of FTase inhibitors. Rho B 
is apparently both geranylgeranylated and farnesylated [20, 21]. If cells were 
treated with 2, vesicular localization of Rho B was inhibited. Thus 2 may also 
inhibit the farnesylation of Rho B, thereby interfering with actin stress fiber 
formation [22]. 

Further modifications of the CAAX tetrapeptide structure led to inhibitor 3 
which blocked H-Ras farnesylation with an IC50 of 11 nmol/1 [23]. Tumor cell 
lines expressing mutant H- and N-Ras were most sensitive against this com- 
pound that inhibits tumor growth of EJ-1 human bladder carcinomas by about 
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60% at a dose of 100 mg/kg. The in vivo results and particularly tumor regres- 
sion induced by the CAAX mimetics described thus clearly demonstrate the 
efficacy of FTase inhibitors as antitumor agents. 

High activity and selectivity was also achieved by replacing the phenylalanine 
residue in Cys-Val-Phe-Met with l,2,3,4-tetrahydroisoquinoline-3-carboxylate 
(Tic) and modifying the peptide backbone to give 5 (Fig. 1) [24]. This compound 
displayed an ICjq of 2.8 nmol/1 against rat FTase and was 500-fold less potent 
against GGTase I. In compound 6 the terminal cysteine residue was replaced by 
4-imidazole leading to an increase in activity to IC50 = 0.79 nmol/1 [25]. 

Replacement of the two aliphatic amino acids in the CAAX motif was achiev- 
ed with benzodiazepines such as 7 and 8 (Fig. 2). 

The central unit of these peptidomimetics imitates a ^-turn and brings 
the NHj-terminus of the cysteine analogue and the COOH terminus of the 
methionine in spatial proximity; these can then complex the Zn^"^ ion which is 
essential for activity of the FTase [26]. The free acid 7 inhibits the enzyme with 
an IC50 value of 1 nmol/1, whilst in intact cells the methyl ester 8, despite its 
weaker in vitro activity, is significantly more potent because it can penetrate the 
plasma membrane better due to its lower polarity. This property can be used to 
convert the morphology of H-Ras-transformed cells back to the normal form 
and to inhibit growth of these cells, whereas the substance shows no effect on 
Src-transformed and untransformed rat fibroblasts. The inhibitor therefore acts 
selectively on transformed cells and does not influence growth of normal cells. 
This result is noteworthy because farnesylation of the wild type H-Ras protein 




R = H 7 
R = CH3 8 



Cys-3-AMBA-Met 9 




SCH 3 



R = H, CH 3 10 

Fig. 2. FTase inhibitors in which the AA dipeptide was replaced 
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is inhibited by the benzodiazepine and H-Ras is involved in growth of normal 
fibroblasts. Detailed examination of this unexpected result shows that inhibitor 
8 reduces the amount of farnesylated H-Ras protein in both non-transformed 
and H-Ras transformed cells. However, only in the transformed cells does this 
lead to the expected reduction of enzyme activity in the Ras cascade; both the 
amount of phosphorylated Raf and the activities of MEK-1, MEK-2, and MAP 
kinase were reduced in the tumor cells [27]. In addition, stimulation of the Ras 
signal pathway was not influenced by EGE in normal cells or in Src-transformed 
cells. Normal cells apparently have the ability to activate the signal pathway 
independently of Ras inhibitory drugs. In fact, further experiments have shown 
that farnesylation of the K-Ras protein by the inhibitors is considerably less 
influenced than that of H-Ras. K-Ras is also a substrate of the GGTase I and can 
therefore be geranylgeranylated if activity of the ETase is reduced. Normal cells 
and Src-transformed cells, in contrast to H-Ras-transformed cells, are able to 
compensate for loss of a Ras protein or an enzyme important for normal cell 
growth, by activation of alternative pathways. 

The two aliphatic amino acids have also been replaced by the hydrophobic 
spacers 3-(aminomethyl)benzoic acid (3-AMBA; see 9, Eig. 2) [28] and 3- and 
4-aminobenzoic acid (3- and 4-ABA) as well as 2-phenylaminobenzoic acid 
(see 10, Eig. 2) [29, 30]. These compounds displayed IC 50 values in the nanomolar 
range, and 10 (R = H) blocked the growth in nude mice of a human lung car- 
cinoma expressing oncogenic Ras [31]. 

Eurther design led to replacement not only of the central AA-dipeptide but 
also of the AAX-tripeptide. Thus, biphenyl derivative 11 [32] and piperazine 
analogue 12 [33] were developed (Eig. 3). They display IC 50 values in the nano- 
molar range, whereas geranylgeranyltransferase was blocked with micromolar 
IC 50 values. Both compounds disrupt Ras processing in cells and piperazine 
derivative 12 suppresses tumor growth by 75% at a dose of 40 mg/kg. 




Fig. 3. FTase inhibitors in which the AAX tripeptide was replaced 



4.2 

Bisubstrate Inhibitors 

The design of ETase inhibitors based on the structure of farnesyl pyrophosphate 
has been pursued with less intensity due to the possible nonselective effects of 
competing with other enzymes such as squalene synthetase that also accept 
farnesylpyrophosphate as substrate [3,4, 9, 10- 12]. 

However, bisubstrate inhibitors incorporating a farnesyl- and a CAAX 
mimetic are very promising since one can expect that they display enhanced 
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activity and selectivity. The phosphinic acids 13 (R = H) and 14 (R = Me, Fig. 4) 
are examples of bisubstrate analogues. 

Indeed, 13 is an effective in vitro inhibitor and the prodrug 14 has activity in 
H-Ras- and - to a lesser extent - K-Ras-transformed cells [34]. Prodrug 14 also 
inhibits growth of malignant cells of the neurofibromatosis type I which pre- 
sumably contain overactive wild type Ras due to the absence of active neuro- 
fibromin GAP [35]. These highly promising results are tempered, however, by 
the lower bioavailability and in vivo activity in comparison to the peptido- 
mimetics. Recently, benzyloxycinnamoylamide 15 and two analogs thereof were 
reported as bisubstrate inhibitors of yeast FTase [36] where the benzoylcinna- 
moyl group mimics the farnesyl moiety. These compounds displayed IC 50 values 
in the low micromolar range. 




OH 



pepticinnamin E 16 
Fig. 4. Bisubstrate inhibitors of FTase 



Ras-Farnesyltransferase-Inhibitors as Promising Anti-Tumor Drugs 



125 



The natural product pepticinnamin E (16) is also considered a bisubstrate 
inhibitor of FTase. The successful synthesis of pepticinnamin E sets the stage to 
prepare analogs of this peptidic compound and to delineate the important 
structural parameters responsible for its ETase-inhibiting activity. Initial studies 
revealed that the central tripeptide part and, in particular, the absolute con- 
figuration of the central chlorinated amino acid are decisive for the inhibitory 
activity [37]. The bisubstrate inhibitors developed so far (as well as some of the 
CAAX mimetics, see above) have the advantage of not containing a free cysteine 
SH-group and thus being more stable. 



4.3 

Inhibitors from Screening Natural Sources and Compound Libraries 

In addition to pepticinnamin E several natural products have been identified as 
ETase inhibitors, including, e. g., manumycin (17) and analogues [38] fusidienol 
(18) [39] and the preussomerins, e.g., 19 [40] (Eig. 5; for further examples see 
[10]). In general, these compounds have been less potent than the CAAX-based 
mimetics typically displaying IC50 values in the low micromolar or the high 
nanomolar range. 

High-throughput screening of compound libraries was used to identify 
further inhibitors of farnesyltransferase. Thus a highly potent pentapeptide 
20 was identified (Fig. 6) that inhibited FTase with IC50 =17 nmol/1 and that 
antagonized Ras in Xenopus oocytes [41]. Systematic derivatization and trunca- 
tion of the peptide backbone finally led to a dipeptide derivative 21 that inhibit- 
ed Ras processing in cells and increased the life span of tumor-bearing nude 
mice by 35 % at 200 mg/kg [ 10, 42]. 

In particular, a series of completely non-peptidic nonsulfhydryl FTase in- 
hibitors 22 and 23 was uncovered (Fig. 6). 

These chlorobenzocycloheptapyridines display pronounced selectivity for 
FTase over GGTase I. They show improved in vivo anti-tumor activity and 
pharmacokinetic profiles in mice when administered orally and inhibit H-Ras 
processing in Gos monkey kidney cells [43, 44]. Extensive variation of the sub- 
stituents on the piperidine moiety and the tricyclic ring system led to the devel- 



O OH 





manumycin A 1 7 

Fig. 5. FTase inhibitors from natural sources 



fusidienol 18 



preussomerin G 19 
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20 





21 





Fig. 6. FTase inhibitors identified from compound libraries 




opment of compound 24 which inhibits FTase with IC 50 = 40 nmol/1 and is in- 
active toward GGTase I. 

5 

Lack of Toxicity to Normal Cells 

Although FTase inhibitors influence the farnesylation of Ras they are likely to 
interfere with the posttranslational modifications of other CAAX-containing 
proteins as well. Apart from the approximately 20 farnesylated proteins that are 
known today, farnesylation is also required for normal Ras function which in 
turn is critical for normal cell viability. For these reasons farnesyltransferase 
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inhibitors were thought to be potentially toxic. However, these drugs proved to 
be surprisingly non-toxic and did not display inhibitory activity against rodent 
fibroblast ceUs [45]. Farnesyltransferase inhibitors not only inhibited the growth 
of transformed cells in culture far more than the growth of normal cells, but also 
showed surprisingly few side effects in the treatment of tumor-bearing animals, 
with no obvious toxicity in normal tissues following treatments of up to six 
weeks. 

There are several possible explanations to account for this apparent lack of 
toxicity. Some geranylgeranylated Ras-related proteins might compensate for 
the loss of Ras function (see, e.g., [46]). Alternatively inhibition of farnesyl 
transferase may reduce Ras activity below the level required for transforma- 
tion, yet allow sufficient Ras activity for maintaining normal cell viability [47]. 
Alternatively, a different signaling pathway may be activated when Ras is not 
anchored to the plasma membrane. 

Toxicity and effectivity studies have often been performed in rodent fibro- 
blast cells containing oncogenic H-Ras. However, prenylation of K-Ras B and 
N-Ras are not as effectively blocked by the farnesyltransferase inhibitors 
as H-Ras [48] (see below). Thus normal cells may be less sensitive to these 
drugs because they express K-Ras 4B and N-Ras. In this context it should be 
noted that H-Ras mutations are relatively uncommon in human tumors [49]. 
Rather, the K-Ras gene is the most frequently mutated in solid human cancers, 
whereas N-Ras is prevalent in leukemias. Thus the preclinical evaluation of 
the farnesylation inhibitors has yet to be critically re-evaluated for trials in 
humans. 

6 

Mechanism of Action 

Intensive investigations of a variety of FTase inhibitors have clearly revealed that 
these drugs demonstrate antiproliferative activity in cell culture against a 
variety of transformed cell lines and tumors in animals. In particular, cells with 
a mutant H-Ras protein are sensitive against these drugs. Although the growth 
of K-Ras 4B transformed cells is also blocked, much higher concentrations 
are needed for this effect since this protein is a much better substrate for FTase 
than H-Ras [50]. Furthermore, the K4B-Ras protein can more easily be geranyl- 
geranylated by GGTase, thus compensating for the lack of farnesylation [51, 52]. 
Since geranylgeranylated forms of Ras proteins can also potently transform 
cells, farnesyltransferase inhibitors are not expected to inhibit the growth of 
tumors with mutant K4B-Ras. However, despite this alternative prenylation 
growth inhibition is observed. Obviously blocking the farnesylation of one or 
more proteins other than Ras is responsible for or at least contributes to the 
observed effect. This is supported by the finding that growth of fibroblast 
cells transformed by an N-terminally myristoylated (and thereby membrane- 
anchored) Ras that cannot become farnesylated or geranylgeranylated is 
inhibited by FTase inhibitors as well [53]. 

These findings have raised questions of which additional proteins are the 
molecular targets of FTase inhibitors. One class of candidates involves members 
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of the Rho family of proteins, in particular Rho B (see Sect. 4.1). Rho B is a 
particularly attractive target as Rho proteins have been shown to be required for 
Ras transformation and Rho B is an immediate early gene response protein 
activated by growth factor signaling. Rho B is targeted by FTase inhibitors (see 
above and [54]). Another novel Ras-related farnesylated protein, Rheb, may also 
be a target. Rheb modulates Ras function and is an antagonist of Ras trans- 
formation. Its processing and membrane localization can be inhibited by a FTase 
inhibitor [55]. 

Many of the biological studies with FTase inhibitors suggested that these 
agents are cytostatic and suppress cell growth rather than being cytotoxic in 
patients. However, farnesyltransferase inhibition was also observed to induce 
tumor regression of mammary and salivary carcinomas in a mouse tumor 
model [45]. The mechanism of shrinkage was unclear, but a recent study showed 
that an FTase inhibitor can induce apoptosis (cellular suicide) in cell culture, 
independent of the p53 tumor suppressor protein (a mediator of apoptosis) [56]. 
These observations may be clinically important because resistance to standard 
anti-cancer agents is often conferred by alterations in p53. Thus FTase inhibitors 
may still be efficacious against tumors containing mutant p53. 

Many of the questions concerning the biological activity of FTase inhibitors 
have been addressed, although numerous questions remain to be answered. In 
particular the consequences of the K4B-Ras geranylgeranylation bypass and the 
uncertainty about the cellular targets of these drugs deserve intensive investiga- 
tion. Nonetheless it is clear and accepted that FTase inhibitors have proven effec- 
tive and shown significant anti-tumor activity against Ras-dependant tumors 
with little or no whole animal toxicity in many animal tumor modes. They have 
now reached the stage of human clinical evaluation [42, 57]. The clinical trials 
will provide answers to questions about efficacy, toxicity, development of resis- 
tance, and mechanism of action of these compounds such that their full thera- 
peutic potential can be appreciated. 

7 

Outlook 

Since Ras is the most frequent oncogene found in human tumors it has long 
been considered an attractive target for anti-cancer therapy. One promising and 
advanced approach is the inhibition of the enzyme farnesyl transferase, which 
attaches the lipophilic isoprenyl group to the C-terminal cysteine of the mature 
protein and is thus vital for Ras function. A number of efficient compounds have 
been developed, showing inhibitory activities in vitro and in vivo up to the 
low nanomolecular range, accompanied by high selectivities. The inhibitory 
properties, chemical stabilities and bio availabilities of these substances could be 
optimized by chemical modification of the appropriate molecular structure. 
Furthermore, FTase-inhibitors show considerable tumor suppression properties 
that are, in some cases, even superior to conventional anti-tumor agents. These 
FTase inhibitors are non-toxic to normal cells and have proved their efficacy 
in various phases of clinical trials. Because of the fact that neither the exact 
mechanisms of action nor the affected molecular targets are well understood 
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yet, it remains an important task to investigate the pathways of signal transduc- 
tion in order to open new, more effective ways in anti-tumor therapy. 
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The PLC class of enzymes has been studied extensively over the past 15-20 years because of 
their involvement in signaling pathways in which extracellular messages are delivered to the 
cell to induce a response. Of the PLC isoenzymes, the PI-PLCs have perhaps been examined in 
the greatest detail because of their key role in initiating cellular response by hydrolyzing the 
phosphodiester bond of phosphatidylinositols and their phosphorylated derivatives to release 
the second messengers IP 3 and DAG. However, the extended release of DAG that is critical to 
maintaining the stimulatory response arises from hydrolysis of the more abundant phosphat- 
idylcholine by PC-PLC or by PLD followed by phosphatidic acid phosphatase. Because no 
eukaryotic PC-PLC has been cloned or isolated in pure form, the phosphatidylcholine-pre- 
ferring PLC from B. cereus (PLCgJ has emerged as a focal point for investigation and as a puta- 
tive model for mammalian PC-PLCs. The similarity of the active site of PLCj,, with other phos- 
phoryl transfer enzymes has also served as a stimulus for mechanistic studies. The present 
account details recent studies of this important member of the PLC superfamily of enzymes. 
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1 

Introduction 

The view of phospholipids and phospholipases has evolved considerably over the 
course of the last 50 years. After their initial discovery, phospholipids were accord- 
ed the important, if somewhat pedestrian, role of maintaining the structural in- 
tegrity of cell membranes. This perception of phospholipids as passive gatekeepers 
of the cell persisted for decades. However, in the early 1 980s, it was discovered that 
the hydrolysis of phospholipids in cellular membranes provided products that 
stimulated a host of cellular fimctions, including those of calcium release and 
enzyme activation [1]. Eurther experimentation indicated that some phospho- 
lipases, which are the enzymes that catalyze the hydrolysis of phospholipids, 
are activated in response to various extracellular signals [2]. Therefore to send a 
message from outside the cell to inside the cell. Mother Nature chose phospholipids 
and their associated phospholipases as one means of transmitting the signal. 

Although there are several classes of phospholipases, the phospholipase G 
(PLC) family is the one that has recently received intense scrutiny in the general 
context of signal transduction. The present account therefore details recent 
structural and mechanistic studies of one member of this important super 
family of enzymes, the phosphatidylcholine-preferring phospholipase C from 
B. cereus (PLG5J. 

2 

Classification and Importance of Phospholipases 

With their newly accorded significance in signal transduction and membrane 
studies, the increase in knowledge about phospholipases has been remarkable. 
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Phospholipases are found in virtually every living organism, from bacteria to 
plants to mammals, and a wealth of information is now available about the 
various classes of phospholipases, which are categorized based upon which ester 
or phosphodiester bond is hydrolyzed (Fig. 1). For example, phospholipase D is 
specific for hydrolyzing the P-0 bond to the polar headgroup of the phospho- 
lipid to furnish an alcohol and phosphatidic acid (PA) (Fig. 1, Path 1). Phospho- 
lipase C hydrolyzes the other phosphodiester bond to give diacylglycerol (DAG) 
and a phosphorylated headgroup (Path 2). Phospholipase Aj (PLAi) and phospho- 
lipase Aj (PLAj) catalyze the hydrolyses of the ester side chains at the sn-l and 
sn-2 positions, respectively, of the glycerol backbone (Paths 3 and 4). 

For a complex multicellular organism to function, cells must rapidly respond 
to messages from other regions of the organism. These messages, often in the 
form of hormones, arrive at the cell membrane through the aqueous environ- 
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Fig. 1. Cleavage sites and products for the hydrolysis of phospholipids by PLD, PLC, PLAi, 
and PLAj 
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ment that comprises the bulk of living things. The message must then be relayed 
into the cell to effect the appropriate response. Because this signal is transduced 
across the cell membrane, the process is known as signal transduction, and the 
entire sequence of interdependent events is referred to as the signal transduc- 
tion cascade. Thus, the cascade is typically initiated with binding of the hormone 
to a receptor protein imbedded in the membrane (Fig. 2). These receptors typi- 
cally span the membrane and have localized areas of hydrophilicity and hydro - 
phobicity. Upon binding the signaling entity, which can be a small molecule 
(e.g., catecholamines) or a protein (e.g., insulin), a conformational change is 
induced in the receptor. This reorganization allows an intracellular loop portion 
of the receptor to contact a guanine-nucleotide-binding regulatory protein, or G 
protein [3], which is comprised of a, {3, and y subunits [4]. 

When the receptor interacts with its associated G protein, the conformation 
of the guanine-nucleotide-binding site is altered. The subunits then dissociate, 
and a phosphatidylinositol-specific phospholipase C (PI-PLC) is activated [5]. 
The subsequent hydrolysis of phosphatidylinositol bisphosphate then produces 
inositol triphosphate (IP3) and diacylglycerol (DAG), which are known to be 
secondary messengers. For example, the water soluble IP3 is released into the cell 
where its ultimate targets are the calcium storage organelles from which Ca^"^ is 
released [3]. The presence of DAG in cells is known to activate the cellular enzyme 
protein kinase C (PKC) [6, 7], which phosphorylates a number of cellular 
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Fig. 2. Activation of an intracellular PLC by an extracellular signal (primary messenger) trans- 
duced across the membrane via a receptor and G-protein. The products DAG and IP3 that are 
produced by the PLC catalyzed hydrolysis of phospholipids are secondary messengers 
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proteins, thereby leading to their activation [2, 8]. Thus, the G-protein coupled 
activation of PLC leads to the production of secondary messengers that relay the 
extracellular message to the interior of the cell, thereby effecting a response. 

The rate of production of DAG in the cell does not occur linearly with time, 
but rather it is biphasic. The first peak is rapid and transient and coincides with 
the formation of IP3 and the release of Ca^"^; this DAG is therefore derived from 
the PI-PLC catalyzed hydrolysis of phosphatidylinositols [1]. There is then an 
extended period of enhanced DAG production that is now known to be derived 
from the more abundant phospholipid phosphatidylcholine (PC), which has a 
different composition of fatty acid side chains [9]. Although DAG may be gen- 
erated directly from PC through the action of PC-PLC, it can also be formed 
indirectly from PC. In this pathway, PC is first hydrolyzed by PLD to give choline 
and phosphatidic acid, which is then converted to DAG by the action of a phos- 
phatidic acid phosphatase [10, 11]. 

Owing to their pivotal role in mammalian signal transduction, there has been 
an intense interest in the enzymes of the PLC superfamily. Progress toward 
understanding the mechanism, structure, and function of PI-PLCs from both 
bacterial and mammalian sources has been particularly impressive [12-15]. 
Several PI-PLCs have been isolated and cloned, and a number of high resolu- 
tion, three-dimensional X-ray structures are available [16-19]. In contrast to 
the advances that have been made with mammalian PI-PLC isoenzymes, their 
PC-PLC counterparts are poorly characterized. Studies with mammalian 
PC-PLCs have typically been conducted with partially purified enzymes, and 
there has not been a report of the isolation of a pure, eukaryotic PC-PLC. To 
circumvent the currently intractable problems associated with mammalian 
PC-PLCs, PLCs from bacterial sources have been sought as potentially useful 
models. 

The history of bacterial phospholipase C activity dates back to 1948 when 
bacterial strains from the Bacillus genus were screened for the hydrolysis of 
phosphatidylcholine on agar plates [20]. B. cereus gave a strong positive response, 
whereas B. anthracis and B. mycoides exhibited weaker activity. Subsequent 
culturing of B. cereus yielded an active phospholipase that is now known as the 
phosphatidylcholine-preferring phospholipase C from Bacillus cereus (PLCbJ 
[21]. This enzyme was ideal for further study because it was readily available in 
pure form. Although the in vivo function of PLC^c has not been established, 
it appears, like alkaline phosphatase and sphingomyelinase, to be involved in 
phosphate retrieval when B. cereus is grown under conditions of limited phos- 
phate [22]. The discovery that PLC^,. had an antigenic similarity with a mamma- 
lian PC-PLC prompted a number of experiments in which PLC^^-was exploited 
to simulate PC-PLC activity [23]. For example, PLCg^can stimulate DAG pro- 
duction in vivo [24], and PLCg^ has been employed to investigate the mecha- 
nisms of oncogenesis [25,26] and membrane damage [27]. 

Thus, PLC^chas emerged as a potentially useful model for the presently in- 
accessible mammalian PC-PLCs and as a tool for studying different facets of 
PLC-mediated cell signaling. The present account will summarize some of the 
recent advances toward understanding the function, structure, and mechanism 
of PLC^;.. 
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3 

Aspects of Phospholipid Hydrolysis by PLCg^ 

One of the first steps toward developing an understanding of the functional 
features of an enzyme catalyzed process is to study the efficiency with which it 
processes different substrates. In the case of PLCg^ there are a number of vari- 
ables in substrate structure whose effect upon catalytic activity and efficiency 
must be examined. These include; (1) the importance of the stereochemistry in 
the glycerol backbone; (2) the nature of the functional links between the side 
chains and the glycerol backbone; (3) the effect of varying the length of the acyl 
side chains; (4) the effect of varying the headgroup; and (5) the effect of altering 
the nature of the phosphodiester moiety. During the course of such studies, it is 
often possible to identify substrate analogues that are inhibitors, and these may 
be useful tools for studying the mechanism of the enzymatic reaction and for 
determining the biological effects upon inhibiting the enzyme in vivo. 



3.1 

Assays for PLCg^ 

Prior to being able to study the function and mechanism of an enzyme, it is 
essential that suitable assays be available to monitor enzyme activity toward 
different substrates and to determine the kinetic parameters and for the 
reactions. A brief overview of the known assays for the evaluation of PLCg^ 
activity is thus appropriate. The ideal assay for a phospholipase C would utilize 
a phospholipid substrate, not an analogue with a modified headgroup or side 
chains. Such an assay should be sensitive to minimize the quantities of enzyme 
and substrates that would be required, and it should be convenient to implement 
so that analyses may be readily performed. 

A titrametric assay of PLCg^, alternatively called the pH-stat method, was the 
workhorse in early studies [28]. This method simply involves titrating the acidic 
product of the PLC reaction as it is formed with a solution of standard base. An 
advantage of this continuous assay is that it can be used to detect the turnover 
of both synthetic and natural substrates, and its sensitivity has been estimated 
to be in the 20-100 nmol range. However, the pH-stat assay has low throughput 
capability, and it cannot be easily performed in a parallel fashion with multiple 
substrate concentrations. It is also necessary to exclude atmospheric carbon 
dioxide from the aqueous media containing the enzyme and substrate. 

Because neither product in the PLCg^ catalyzed hydrolysis of natural phospho- 
lipids is chromogenic, non-natural substrates have been designed and devel- 
oped explicitly for the spectrophotometric analysis of enzyme activity (Fig. 3). 
Hydrolysis of p-nitrophenyl phosphorylcholine (NPPC) by PLC^^ produces the 
p-nitrophenolate anion, which gives a yellow color at pH = 8.0. Although using 
NPPC as a substrate allows for the facile evaluation of PLC^c activity, NPPC 
is a very poor substrate for PLC^c [29, 30]; hence the sensitivity of this assay 
is low. Another non-natural substrate is l,2-dioctanoyl-sn-(3-thioglycero)- 
phosphocholine [31], hydrolysis of which by PLC^c liberates a free thiol that 
may be easily quantitated with 5,5'-dithio-bis(2-nitrobenzoic acid) (DTNB, 
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p-Nitrophenyl phosphorylcholine 




1,2-Dioctanoyl-sr)-(3-thioglycero)phosphocholine 
Fig. 3. Non-natural substrates for PLCg,; 



EUman’s Reagent) [32]. Unfortunately, PLC^c processes this thiophospholipid 
100 times slower than a natural phosphatidylcholine, so this assay is also of 
limited utility. 

A more successful strategy for developing sensitive and facile assays to monitor 
PLCg,. activity involves converting the phosphorylated headgroup into a color- 
imetric agent via a series of enzyme coupled reactions. For example, phos- 
phatidylcholine hydrolysis can be easily monitored in a rapid and sensitive 
manner by enzymatically converting the phosphorylcholine product into a red 
dye through the sequential action of alkaline phosphatase, choline oxidase, 
and peroxidase [33]. This assay, in which 10 nmol of phosphorylcholine can be 
readily detected, may be executed in a 96-well format and has been utilized in 
deuterium isotope and solvent viscosity studies [34] and to evaluate inhibitors 
of PLC^c [33] and site-directed mutants of PLC^,. [35, 36]. 

Because the preceding chromogenic assay rely on choline quantitation, the 
hydrolysis of substrates with headgroups other than choline cannot be followed. 
To circumvent this problem, another useful protocol was devised whereby the 
phosphorylated headgroup produced by the PLCsc hydrolysis is treated with 
APase, and the inorganic phosphate (Pi) that is thus generated is quantitated by 
the formation of a blue complex with ammonium molybdate/ascorbic acid; 
5 nmol of phosphate may be easily detected. This assay, which may also be per- 
formed in a 96-well format, has been utilized to determine the kinetic param- 
eters for the hydrolysis of a number of substrates by PLC^,. [37, 38]. 

Several other methods, including a ^'P NMR [39, 40] and radiometric assay 
[28, 41] have also been used to monitor the PLC^c catalyzed hydrolysis of 
phospholipids. The assays that are currently available for PLC^care compared in 
Table 1. 
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Table 1. Comparison of common assays for phospholipase C 



Assay method 


Natural 

substrate? 


Rapid and 
simple? 


Sensitive and 
accurate? 


Use with variety 
of headgroups? 


Titrametric 


Yes 


No 


Yes 


Yes 


NPPC 


No 


Yes 


No 


No 


Thiophosphate 


No 


Yes 


No 


Yes 


Choline quantitation 


Yes 


Yes 


Yes 


No 


Pi quantitation 


Yes 


Yes 


Yes 


Yes 


Radiometric 


Yes 


No 


Yes 


Yes 


^'P NMR 


Yes 


No 


No 


Yes 



3.2 

Substrate Structure-Activity Relationships 

A number of experiments have been conducted to establish various aspects 
of structure-activity relationships (SAR) of substrates. Although the greatest 
emphasis has been upon the nature and orientation of the side chains on the 
glycerol backbone, variations in the phosphate headgroup have also been explored. 



3.2.1 

The Side Chains 

The glycerol side chains of the phospholipid substrate play an important role in 
recognition and binding to the enzyme, and variations in the orientation, nature 
and the length of these side chains can have dramatic effects on PLC^^ activity. 
Natural phospholipids exist in the R stereochemical configuration at the sn-2 
center, and the corresponding S-isomer is hydrolyzed 40 times slower than the 
natural phospholipid (Table 2) [42]. The introduction of methyl groups adjacent 
to the ester carbonyl group provides phospholipid derivatives that are poorer 
substrates [43]. Replacing the sn-1 and especially the sn-2 ester linkages with an 
ether linkage gives analogues with low levels of enzymatic activity (Table 2) [43, 
44]. Although it is at first somewhat surprising that removal of a carbonyl group 
would so drastically affect activity, the crystal structure of PLC^^ complexed 
with a phosphonate inhibitor provides a rationale for the observed activity: The 
sn-2 carbonyl group of the inhibitor is engaged in a hydrogen bond to the back- 
bone N-H of Asnl34 [45]. Substrates lacking the sn-2 carbonyl function cannot 
form this hydrogen bond, thereby resulting in lower kinetic activity. Thus, it 
follows that the lyso analogue, which lacks the sn-2 side chain, is processed by 
PLC^j approximately 1000 times slower than its diacyl parent [46]. On the other 
hand, when the sn - 1 side chain is removed, the resulting phospholipid analogue 
is turned over at only a slightly reduced rate relative to the diacyl parent 
(Table 2) [42, 44]. Sphingomyelin, in which the sn-2 ester is an amide and the 
sn-1 carbonyl removed, is not a good substrate for PLCg^ [47]. 

The length of the acyl side chains of a phosphatidylcholine derivative have 
significant effects upon the rate with which PLCsc catalyzes the hydrolysis of 
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Table 2. Relative rates of hydrolysis of phosphatidylcholine derivatives by PLCbc 



Substrate 



Relative activity Reference 




phospholipids [48]. As the length of the side chains increases, the critical micelle 
concentration (CMC) of the substrate decreases, and there is a discontinuity in 
the rate progression curve at the CMC. Thus, phosphatidylcholine derivatives in 
micellular form are hydrolyzed about two to five times faster than when they are 
in monomeric form [49-51]. The reason for this interfacial activation is not 
fully understood, but several theories have been advanced. One possibility is 
that the release of DAG is more facile into a lipophilic micelle than into aqueous 
solution, so by accelerating the rate of departure of this leaving group, the over- 
all rate of turnover increases. This interpretation requires that DAG release is 
rate-limiting in the PLC^^ reaction of micellular substrates [49]. However, it has 
been shown that DAG release is not rate determining in reactions of soluble sub- 
strates at concentrations below their CMCs [34]. 

Another interesting observation is that as the length of the side chain de- 
creases the K^s of monomerically-dissolved phosphatidylcholines increase 
significantly, while V^ax changes very little (Table 3) [43, 52]. NMR studies of 
phosphatidylcholines possessing three, four, and six carbons in each of the acyl 
side chains have been conducted to determine whether these variations in 
might arise from differences in the solution conformation of the substrate [52]. 
However, the length of the side chain had little effect upon the relative orienta- 
tions of the substituents on the glycerol backbone, so it appears that factors 
other than ground state conformational preferences of the phospholipid are 
responsible for the observed variations in K^. Such factors might involve decreas- 
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Table 3. The effect of acyl chain length on PLCs^ activity toward soluble phosphatidylcholine deriva- 
tives. The differences in the experimental data are likely due to the use of different assay conditions 



Substrate 



Vmax K,JK^ Reference 

(^imol/min • mg) (mM) (l/s*mM) 



O 







2290 61 18 




,,P 





1430 (750) 1.2 (40) 32 




2100 (1000) 



2.4 (0.36) 417 



[52] 



[43, 52] 



[43, 52] 




ed hydrophobic interactions between the enzyme and the substrate and/or differ- 
ential solvation/ desolvation effects associated with substrate binding. 



3.2.2 

Modifications of Polar Headgroup 

Although the major focus of substrate SAR studies has been on phosphatidyl- 
choline derivatives, phospholipids with other headgroups have also been analyz- 
ed. In a study of naturally occurring phosphatidylcholines, phosphatidyl- 
ethanolamines (PE), and phosphatidylserines (PS) having acyl side chains 
ranging in length from 12 to 18 carbon atoms each, PE and PS exhibited 
that were approximately 25 % that of PC [53]. However, it should be emphasized 
that these compounds had different side chain lengths, so it was not possible to 
compare directly the effect of having different headgroups. 

Recently, a series of soluble phospholipids with n-hexanoyl side chains and 
a variety of headgroups were assayed as substrates for PLCj^ [37]. The results 
(Table 4) indicate that both l,2-dihexanoyl-sn-glycero-3-phosphocholine 
(C6PC) and l,2-dihexanoyl-sn-glycero-3-phosphoethanolamine (C6PE) are pro- 
cessed with similar catalytic efficiencies (/Ccat/f^m) by the enzyme, while 1,2-dihe- 
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Table 4. Kinetic values for the hydrolysis of phospholipids with various headgroups by PLCg 



Substrate 



Kn, Reference 



(|imol/min • mg) (1/s) (mM) (l/(s*mM) 



©op 



I© 



s. ,-Pn, ”” 

° ° 2100±150 



1000 2.4±0.4 417 



[38] 



O 

C6PC 






® 0 , P © 



1130±100 540 1.8±0.2 300 



[38] 



O 

C6PE 



I L ^ 435±85 

,0 CO 2 0 



210 4.5±1.0 47 



O 

C6PS 

o 

C6PDB 



11 



5.2 14 0.37 



[38] 



[37] 



xanoyl-sn-glycero-3-phosphoserine (C6PS) has a roughly tenfold lower 
The unnatural substrate l,2-dihexanoyl-sn-glycero-3-phospho-3',3'-dimethyl- 
1 -butane (C6PDB), which has a headgroup isosteric with that of C6PC but 
lacking the positive charge, is hydrolyzed by PLC^,. about 1000 times less effi- 
ciently than C6PC [38]. Thus, a positive charge on the phospholipid headgroup 
is important for recognition and catalysis. 



3.2.3 

Modifications of Phosphate Group 

The constellation of atoms about the phosphorus of the phosphodiester linkage 
has been varied by replacing the bridging and non-bridging oxygens with sulfur, 
carbon, and nitrogen to give compounds that may be either substrates or inhi- 
bitors of PLCbc. For example, substitution of the bridging oxygen in the scissile 
oxygen-phosphorus bond with a sulfur atom gives a substrate that is turned over 
very slowly, whereas replacing this oxygen with NH, NBn, CHj, or CF 2 provides 
competitive inhibitors (Table 5) [42, 54]. The structure of PLC^^ complexed with 
a phosphonate inhibitor was elucidated by X-ray crystallographic analysis [45]. 
When both non-bridging oxygens are replaced with sulfur, a potent inhibitor of 
PLCg,. activity is obtained [54]. 
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Table 5. The effect upon PLCb^ activity of replacing bridging and non-bridging oxygens around 
phosphorus in soluble phosphatidylcholine derivatives 



Phospholipid 



If; Reference 

(mM) (mM) 




3.3 

Zinc Binding Inhibitors 

There are inhibitors of PLC^^ that are substrate analogues and non-substrate 
analogues. As evidenced in the preceding section, inhibitors that are substrate 
analogues contain non-hydrolyzable replacements of the phosphate group. One 
group of non-substrate analogues that are PLC^^ inhibitors exploit their zinc 
binding ability as the basis for PLCg^ inhibition; such inhibitors interact with 
one or more of the three active site zinc ions. One important inhibitor of this 
type is tricyclodecan-9-yl-xanthogenate (D609) (Fig. 4), a commercially avail- 
able inhibitor that inhibits PLCgc {K^ - 6.4 pmol h^) [55]. D-609 is known to pre- 
vent the activation of protein kinase C and have anti-tumor and anti-viral pro- 
perties [56], and it has recently been used in numerous studies that implicate 
mammalian PC-PLC participation in various signaling pathways such as those 
involving interleukin production and apoptosis [57 - 59]. However, experiments 
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D609 







2,7-Dihydroxytopolone ^ Dihydroxy-5,6-dihydro- 

1,3-diazepin-2,4-dione 



Fig. 4. Non-substrate analogues as inhibitors of PLCj,. 



using D-609 must be interpreted with caution because it also affects PLD activ- 
ity [24], 

The zinc binding ability of small molecules has been exploited as a strategy 
for discovering other PLCb^ inhibitors, although a potential limitation of this 
approach is that inhibitors so identified may not be selective. For example, it was 
recently found that 2,7-dihydroxytropolone (Fig. 4), which is an inhibitor of the 
bimetallic hydrolases alkaline phosphatase and inositol monophosphatase [60], is 
also a strong inhibitor of PLC^,., having a K; of ~ 16 pmol at pH 7.3 [61]. Cyclic 

N,]y'-dihydroxy ureas also weakly inhibit PLC^,. as illustrated by l,3-N,N'-di- 
hydroxy-5,6-dihydro-l,3-diazepin-2,4-dione (Fig. 4), which exhibited a of 
388 pmol 1 ' at pH 7.3 and 53 pmol 1 ' at pH 9.5 [61]. The pK^s of the ]y-hydroxyl 
groups in this and other cyclic N,N ' -dihydroxy ureas played a key role in inhibitor 
potency, presumably because an ionized ]y-hydroxyl function would be expect- 
ed to bind to a zinc ion better than a neutral hydroxyl group. Whether tropolones 
or cyclic dihydroxy ureas will prove to be useful and selective inhibitors of 
PLCg,. remains to be demonstrated. 

4 

Structural Features of PLCg^ and its Complexes 

Protein X-ray crystallography is one of the best means of gaining insights that 
are necessary to begin to understand the function, substrate specificity and 
mechanism of a protein. Hence, the X-ray analysis of PLC^c set the stage for 
numerous structural and mechanistic studies. The high resolution (1.5 A) struc- 
ture of PLC^c showed it to be a monomeric protein containing three zinc ions in 
its active site [62]. The enzyme is roughly ellipsoidal in shape, 40 x 30 x 20 A, 
and about 66% of the amino acids are located in ten a-helical regions with the 
remainder of the residues being in flexible loops connecting the helices. The sur- 
face of the protein is relatively smooth except for an 8 A deep by 5 A wide cleft 
at the active site, which is readily apparent in the complex of PLCsc with a phos- 
phonate inhibitor (Fig. 5) [45]. These and other structures of PLC^c complexes 
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Fig. 5. View of PLCbc bound to a phosphonate substrate analogue showing protein backbone 
(cyan), zinc ions (magenta), and inhibitor with carbons (white), oxygens (red), phosphorus 
(orange), and nitrogen (blue) [45] 



have revealed a number of important features of the enzyme and the manner in 
which it binds substrate analogues. Of particular interest are the novel metal 
binding site, the choline binding pocket and the interactions of the enzyme with 
the lipid side chains. 



4.1 

Nature of Zinc Center 

Because of the unusual cluster of three zinc ions at its active site, the tri-metal 
center of PLC^c has been the subject of considerable interest. The amino acid 
side chains and water molecules that comprise the first coordination sphere of 
these zinc ions in the native structure are represented schematically in Fig. 6. 
All of the zincs are pentacoordinate, and each adopts an approximate trigonal- 
bipyramidal geometry. The distance between Znl to Zn3 is 3.6 A, qualifying it as 
a bi-metallic center [63]. The side chain carboxyl group of Aspl22 and a water 
molecule bridge these two ions. The remainder of the Zn3 coordination sphere 
is filled by the amino group and backbone carbonyl of the iV- terminal Trpl and 
the e-nitrogen of Hisl4. The Znl ion in turn is ligated to His69, HisllS, and 
Asp55. The constellation of two histidines (Hisl28 and Hisl42) and a glutamic 
acid (Glul46) about Zn2 resembles that of a classic zinc site. The importance of 
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Fig. 6. Zinc ligands in the native PLCb^ structure [62] 



Hisll8,Glul46,Hisl28, His 1 42, Trp 1 , and His 1 4 as zinc binding ligands has been 
supported by a series of site-directed mutagenesis studies in which each of these 
residues was replaced with Ala; each mutant bound only two zinc ions [35, 64]. 

Inasmuch as structures of enzymes complexed with substrate-like or product- 
like ligands can reveal useful information about the mechanism of the enzyme 
catalyzed reaction, crystals of PLC^c were soaked with phosphorylcholine, phos- 
phorylethanolamine, and phosphorylserine, but no complexes were formed 
[65]. Tris, which is a common biological buffer, inhibits PLCg,. activity by 50% at 
5 mmol 1 ' and 100 % at 50 mmol 1 ', and the structure of a complex of PLCg,. with 
Tris has been obtained. In this complex, the amine nitrogen of Tris is coordinat- 
ed to Zn2, thereby replacing a water molecule, and the three hydroxyl groups of 
Tris are involved in a hydrogen bonding network with the protein [66]. This 
binding motif places the Tris molecule at the opening of the PLCgc active site 
blocking access of substrate to Znl and Zn3. 

From the mechanistic viewpoint, a more informative structure is the complex 
of PLC^^-with inorganic phosphate (Pi), which also inhibits the enzyme (50% 
inhibition at 50 mmol Pi) [65]. In this complex, which was obtained at 2.1 A 
resolution, the three zinc ions were coordinated to the two non-bridging phos- 
phate oxygens (Fig. 7 a). Comparison of this structure with the native one (Fig. 6) 
reveals that one of the non-bridging oxygens on the phosphate replaced the 
bridging water molecule between Znl and Zn3, while the other oxygen displaced 
one of the waters coordinated to Zn2. 



146 



P. J. Hergenrother • S. F. Martin 



“OHp 



2.1 A \ 



b ^00 

/V 



(^His^4£j ( His 128 ^ 

JQ ' 

{ Glu 146 Y 
^ J 2.1 A \ 



O 

,o>"» 



O 

,^(CH3)3 



2-2V°'<2.0A X His 69 ) ( Trp 1 2-1 2.1 A 



2.7 A 



/ '''.^ Asp 55 ) 

( Asp 122 ) 



\(Zn3) -• — »i.(Zn1 



^ 2.7 A 



\ '{ Asp 55 ) 



. ( His 14 ‘'i ( Asp 122 ) ^ ^ 

( His 118 J ^ ■ J [ His 118 J 



(a) (b) 

Fig. 7 a, b. Active site showing Znl - Zn3 and Zn3 - Asp55 distances in complexes of PLCg^ with: 
a inorganic phosphate [65]; b a phosphonate inhibitor [45] 



A major advance that led to a much better understanding of how substrate 
might bind to PLCscWas a seminal report detailing the structure of the complex 
of PLC^c with the phosphonate inhibitor 3(S),4-dihexanoylbutyl-l-phosphonyl- 
choline (K’j= 1.15 mM) (Fig. 5) [45]. This structure revealed that the phos- 
phonate moiety bound to the active site zinc ions in a manner similar to in- 
organic phosphate (Fig. 7b). The non-bridging oxygens of phosphorus were 
coordinated to all three zinc ions, replacing the Znl-Zn3 bridging water and 
one water on Zn2. In this complex, however, the second water that was bound to 
Zn2 in the native enzyme is absent making this ion tetracoordinate. The coordina- 
tion state of Zn2 may provide a clue to how PLC^c catalyzes the hydrolysis of 
phospholipids, and this issue will be revisited in the context of mechanistic 
studies (see Sects. 5.2. 1.4 and 5.2.2. 1). 

There are only minor differences between the structures of native PLC^^ and 
its complexes with inorganic phosphate and the phosphonate inhibitor. How- 
ever, one of these may be significant. In the X-ray structure of the uncomplexed 
enzyme, one of the carboxyl oxygens of Asp55 is 2.3 A from Znl (Fig. 6), whereas 
it is about 2.70 A in both the phosphate and phosphonate complexes (Fig. 7). 
This shift is accompanied by a decrease in the distance between Znl and Zn3 
from about 3.6 A in the native structure to about 3.5 A in the complexed struc- 
tures. Although the reasons for these changes are unknown, the observation that 
Asp55 is more loosely coordinated to Znl in the complexes than in the native 
enzyme may have a bearing on the potential role of Asp55 as the general base in 
the mechanism of PLC^c catalyzed hydrolysis of phospholipids (see Sect. 5.2. 1.3). 

While having three metal ions in an enzyme active site is uncommon, it is 
not unique to PLC^c • The well-known alkaline phosphatase from £. coli (APase) 
contains two zinc ions and a magnesium ion [67], whereas the a-toxin from 
Clostridium perfringens [68]. and the PI nuclease from Penicillium citrinum [69] 
each contain three zinc ions. Indeed, the zinc ions and coordinating ligands of 
PI nuclease bear an uncanny resemblance to those of PLCg^. as the only differ- 
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Fig. 8a, b. Proposed modes of substrate binding to: a PLCs^ [45]; b PI nuclease [70], based on 
X-ray structures with non-hydrolyzable substrate analogues 



ence is that the Glu ligand for Zn2 in PLC^^ is an Asp in PI nuclease. Because 
PLC^c and P 1 nuclease have virtually identical active sites, it is tempting to invoke 
similar modes of binding and catalysis. However, there are significant structural 
differences in these enzymes, and they bind substrate analogues in very different 
fashions. For example, in a complex with the uncleavable diastereomer of a thio- 
phosphorylated dinucleotide, only Zn2 of PI nuclease is coordinated with a 
phosphate oxygen. The binuclear zinc site of Znl and Zn3 had a bridging water 
molecule and appeared inaccessible [69]. In a subsequent X-ray study of PI 
nuclease with other substrate analogues, one of the non-bridging phosphate 
oxygens was coordinated to Zn2 while the other was electrostatically engaged 
with the side chain of Arg48; again the bridging Znl -Zn3 water remained in 
place [70]. 

Based upon these findings, it appears that PI nuclease binds substrates in a 
markedly different manner (Fig. 8b) than PLC^,. (Fig. 8a), and the enzymatic 
mechanism for the hydrolysis of phosphodiester bonds by the two is likely 
different. PLC^c contains no residue analogous to Arg48 in PI nuclease, so PLC^c 
binds and neutralizes the negative charge of the substrate with three zinc ions. 
On the other hand, PI nuclease uses only one zinc and the guanidino group of 
an arginine; this binding mode frees the remaining two zincs of PI nuclease to 
perform another function, possibly to activate the bridging hydroxyl group for 
nucleophilic attack on the phosphodiester bond. 

4.2 

The Choline Binding Site 

The substrate binding cleft of PLC^c that accommodates the choline headgroup 
is lined on one side with Glu4, Asp55, Tyr56, and Glul46 and on the other with 
Ser64, Thr65, Phe66, Ile80, Thrl33, Leul35, and Serl43. Of particular interest is 
the arrangement of Glu4, Tyr56 and Phe66 about the positively charged tri- 
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methylammonium moiety. Namely, the side chain carboxyl group of Glu4 is 
3.9 A away from the partially-positive polarized hydrogens on the methyl groups, 
whereas the centroids of the aromatic rings of Tyr56 and Phe66 are approxi- 
mately 4.7 A and 4.2 A, respectively, from these methyl groups (Fig. 9) [45]. The 
questions that immediately arise are what are the roles of these different amino 
acid side chains in stabilizing the positive charge on the phospholipid head- 
group and what contributions do these residues make to substrate binding and 
specificity? 

The choline binding pocket of PLCbc provides an interesting case study of 
how the side chains of amino acids interact with ammonium ions in biological 
systems. It seems reasonable to assume that the carboxyl group of Glu4 stabil- 
izes the positive charge by an electrostatic interaction. Both Tyr56 and Phe66 are 
positioned so that one n-face of each of the respective aromatic rings, which 
have partial negative character because of the quadrupole moment of the rings 
[71], is coordinated with the trimethylammonium ion to stabilize the positive 
charge through electrostatic effects. Such n-cation interactions represent a 
genre of protein-ligand recognition motifs that was not widely recognized until 
about a decade ago [72], but there have since been a number of studies do- 
cumenting their importance [73-77]. 

A question at this stage is how do individual changes in the polar/hydrophilic 
nature of the side chains at residues 4, 56, and 66 influence the kinetic param- 
eters and and catalytic efficiency, k^JK^, for the hydrolyses of PG, PE, 
and PS? To address this issue, a series of single mutants of Glu4 (E4Q, E4D, E4L), 
Tyr56 (Y56W, Y56R, Y56A), and Phe66 (E66Y, E66W, E66A) were prepared by 
PGR mutagenesis [78]. The mutant proteins, which were determined to be struc- 




Fig. 9. Stabilizing interactions of trimethylammonium ion of a substrate analogue with Glu4, 
Tyr56, and Phe66 of PLCj^ [45] 
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Table 6. Qualitative assessment of the contributions of Glu4, Tyr56, and Phe66 to the hydrolysis 
of soluble phospholipid substrates PC, PE, and PS by PLCb„ 


PLCbc residue 


Important for 
PC processing? 


Important for 
PE processing? 


Important for 
PS processing? 


Glu4 


+ + 


+ + 


_ 


Tyr56 


+ + 


+ 


+ 


Phe66 


+ + + 


+ + + 


+ + + 



mutants retain <5% wild-type activity. 

-t-l-, most mutants retain < 15% wild-type activity, 
-t, most mutants retain <30% wild-type activity, 
mutants retain > 50 % wild-type activity. 



turally similar to the wild-type by temperature dependent CD and atomic ab- 
sorption, were assayed for their ability to catalyze the hydrolysis of the soluble 
phospholipid substrates C6PC, C6PE, and C6PS. 

The relative importance of Glu4, Tyr56, and Phe66 in the PLC^c catalyzed 
hydrolyses of different phospholipids is qualitatively summarized in Table 6, but 
a few more specific comments are warranted. The biggest changes in the kinetic 
parameters were in which varied over nearly three orders of magnitude, 
whereas changes in varied by less than a factor of ten. Mutations of Glu4 had 

little effect on and for the hydrolysis of C6PS, but they did influence the 
kinetic parameters for C6PC and C6PE. Thus, a carboxyl group at the E4 posi- 
tion is beneficial for the hydrolyses of C6PC and C6PE, which each have a posi- 
tive charge on the headgroup of the substrate, but not for the hydrolysis of C6PS, 
which has no net charge on the headgroup. Interestingly, E4Q hydrolyzes C6PS 
faster than wild- type. An aromatic residue at position 56 appears to confer some 
substrate specificity on the enzyme as Y56W is more selective toward C6PC and 
C6PE relative to C6PS than the wild-type enzyme; this selectivity is largely 
a consequence of a significant drop in for C6PS. Mutations of Phe66 to 
a non-aromatic residue led to dramatic drops in catalytic efficiencies that were 
manifested in large decreases in for each of the substrates. Although these 
experiments have revealed some interesting relationships between “choline” 
binding residues and catalysis, they are basically silent on binding, and additio- 
nal experiments, such as microcalorimetry of protein-ligand complexes, will be 
required to gain insights on this important issue. 



4.3 

The Binding of Lipid Side Chains 

Although the stabilizing interactions between the amino acid side chains of 
PLC^,. and the choline headgroup are readily apparent in the PLC£,.-phosphonate 
inhibitor complex, it is more difficult to identify contacts between the protein 
and the acyl chains of the inhibitor [45]. In part this is because thermal motion 
in the acyl side chains, especially the sn-1 chain, renders them somewhat dis- 
ordered. Consequently, the measured distances between the side chain carbons 
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and enzyme residues cannot be taken as evidence for or against favorable hydro- 
phobic interactions. With this caveat in mind, there appear to be possible contacts 
at distances in the range 3.9 - 4.5 A between the side chains of Leul35, Phe66, and 
Phe70 and the C(2) - C(6) segment of the sn-2 side chain. These interactions may 
be important as determinants of enzyme activity because, in their absence, 
might be expected to increase. Indeed, as the data in Table 3 indicate, decreasing 
the length of the acyl side chains in phosphatidylcholine derivatives does result 
in significant increases in 

One interaction between the enzyme and the phosphonate inhibitor that is 
unambiguously significant involves the strong hydrogen bond (3.0 A) between 
the backbone amide of Asnl34 and the carbonyl group of the sn-2 ester function. 
This interaction provides a rationale for the observation that a simple ether link- 
age results in a greatly diminished rate of PLC^^ processing (see Sect. 3.2.1). 
Presumably because phospholipid analogues lacking the sn-2 carbonyl group 
cannot make this hydrogen bond, they are poorly recognized by the enzyme and 
hence are poor substrates. 

5 

PLCgf Mechanism 

Before discussing the mechanistic aspects of the PLC^^ catalyzed hydrolysis of 
phospholipids, a brief survey of the manner in which metal ions in enzyme 
active sites participate in catalysis is warranted. 



5.1 

Mechanisms of Metalloenzymes 

Two major roles that the Lewis acidity of metal ions can play include: (1) co- 
ordination with the substrate to increase its electrophilicity and render it more 
susceptible to nucleophilic attack and (2) coordination with an active site water, 
thereby lowering its pK^. The subclass of metallohydrolyases to which PLCg^ 
belongs is characterized by a bi-metallic or bi-nuclear center wherein two metal 
ions in the active site are relatively close to one another (3 -4 A) [63]. These two 
ions are bridged by a single amino acid ligand (typically a carboxylate) and a 
water molecule. Due to the strong Lewis acidity of the two metal ions, this bridg- 
ing water likely exists as a hydroxide ion at physiological pH. 

There are three mechanistic possibilities for catalysis by two-metal ion sites 
(Fig. 10). The first of these is the classic two-metal ion catalysis in which one 
metal plays the dominant role in activating the substrate toward nucleophilic 
attack, while the other metal ion furnishes the bound hydroxide as the nucleo- 
phile (Fig. 10 a). Upon substrate binding, the previously bridged hydroxide shifts 
to coordinate predominately with one metal ion. Enzymes believed to function 
through such a mechanism include a purple acid phosphatase [79], DNA poly- 
merase I [80], inositol monophosphatase [81],fructose-l,6-bisphosphatase [82], 
Bam HI [83],andribozymes [63]. 

In a second scenario, the substrate is not necessarily coordinated to the metal 
ions but rather is positioned by amino acid side chains at the active site. The 
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Fig. 10. Participation of a bi-nuclear metal site in enzymatic catalysis: (a) the substrate coordi- 
nates to one metal ion while the other activates the nucleophile; (b) the substrate is stabilized 
by amino acid side chains, and the metal ions provide the bridging hydroxide as the nucleo- 
phile; (c) the bridging hydroxide is displaced by the substrate, and the nucleophile is another 
water, which may be activated by an amino acid side chain in the active site 



bi-metallic site then lowers the of the bridging water to generate the nucleo- 

philic hydroxide ion, even at low pHs (Fig. 10b). Enzymes believed to use this 
tactic of nucleophile activation include PI nuclease [69, 70], leucine amino- 
peptidase [84], and inorganic pyrophosphatase [85]. 

In the final case, the bridging hydroxide is displaced upon substrate binding 
(Fig. 10c). The metal ions would then act as classic Lewis acids to polarize the 
substrate, thereby activating it for attack. The nucleophilic water is not co- 
ordinated with a metal ion in the bi-metallic center, but rather it would be activat- 
ed by an amino acid side chain that serves as a general base. Even though this 
latter mode of substrate and nucleophile activation is possible, there presently 
appears to be no well-documented case of a metalloenzyme with a bi-metallic 
center that catalyzes hydrolysis in this fashion. It should be noted however that 
an enzyme like PLCg,. having three metal ions in the active site might operate by 
a related mechanistic pathway in which the water is activated by the third metal 
ion and/or an amino acid side chain. 

A comparison of the X-ray crystal structures of the enzyme in its native state 
with that of a complex with an inhibitor can provide clues as to which of these 
three mechanistic possibilities is operative. In most cases the location of the 
bridging water in the inhibitor complex is diagnostic of the role of the metal ions 
in catalysis. For example, if this water is present in the inhibitor complex and if 
the inhibitor is coordinated to the metal, mechanism (a) is probable. However, if 
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the water remains in place and the inhibitor is not coordinated to a metal ion, 
mechanism (b) is more likely operative. Conversely, if the water is displaced 
upon inhibitor binding and the inhibitor is coordinated to one or both metal 
ions, then the reaction probably proceeds via mechanism (c). Of course, the 
usual caveat that such complexes are catalytically inactive should be borne in 
mind during this analysis. 

5.2 

The Catalytic Mechanism of PLCb^ 

Because of its unusual active site and its potential as a model for mammalian 
PC-PLC enzymes, there has been considerable interest in the mechanism of 
the PLC^c catalyzed hydrolysis of phospholipids. The few mechanistic points 
that are commonly accepted are that a water must be activated for attack on 
the phosphodiester linkage, the leaving group must be protonated, and the 
products must leave the enzyme active site (Fig. 11). These issues will now be 
examined. 
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Fig. 1 1 . Necessary elements of the PLCj^ mechanism. An active site water is activated for attack 
on the phosphodiester, a proton is transferred to the departing DAG, and the products leave 
the active site 
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5.2.1 

Activation of the Nudeophiie 

One of the questions that is commonly addressed in mechanistic proposals is 
how is the active site water activated for nucleophilic attack on the phosphodi- 
ester bond? Numerous combinations of amino acid side chains and zinc ions 
have been proposed for this role, but there has been little consensus. Critical to 
all the general base hypotheses is a quite reasonable assumption about catalysis 
by PLC^c : The nucleophilic attack on the phosphodiester moiety proceeds via an 
in-line mechanism resulting in stereochemical inversion of configuration at 
phosphorus [86]. While this assumption is consistent with the position of the 
active site water molecules in the PLCgc-phosphonate inhibitor complex [45], it 
has not yet been established experimentally. This structure provides a detailed 
picture of how the amino acid side chains of Glul46, Glu4, Asp55, and the zinc 
ions interact with the phosphonate inhibitor (Fig. 12), so mechanistic hypo- 
theses now have a structural basis. 



5.2.1. 1 

Giu146 as a Generai Base 

One mechanism that was proposed for the PCGg^ reaction was based upon the 
observation that an active site water ( W 1 ) was 2.5 A from the side chain carboxyl 
group of Glul46, which would then serve as the putative general base [45, 87]. In 
this scenario Glul46 deprotonates Wl, which is 4.4 A from the phosphorus of 
the inhibitor (Fig. 12). This water is opposite the DAG leaving group and in 
position to perform an in-line, SN2(P)-type attack on the phosphodiester via 
an associative mechanism [88]. However, activation of this water by Glul46 is 
problematic because it requires that Glul46 function both as a ligand for Zn2 




Fig. 12 . Distances (A) of PLCb^ active site waters from selected amino acid side chains and a 
phosphonate substrate analogue [45] 
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and as the general base. Although a similar dual role for a carboxyl group has 
been proposed for DNA polymerase-/! [80] and inositol monophosphatase [81], 
the ability of a carboxyl group to serve both functions has been openly question- 
ed because coordination with a zinc ion would lower its basicity [86]. 

To test experimentally whether Glul46 might be the general base, the single 
mutants E146L, E146D, and E146Q of PLC^^ were expressed in E. coli, isolated, 
purified, and characterized [35]. The kinetic parameters and were deter- 
mined by the choline quantitation method [33]. The mutants were found to 
retain 0.2- 1.6% of the wild- type activity with the major difference in activity 
being in Acatl the values were all similar to that of the wild type. Although 
there was a significant decrease in catalytic efficiency (60 -500-fold) in these 
mutants, the loss in activity did not approach the 10,000- 100,000-fold decrease 
often observed upon removal of the general base. Eor example, removal of the 
putative general base in the phosphoryltransfer enzymes triosephosphate iso- 
merase [89] and fructose-1, 6-bisphosphatase [82] gave enzymes with a 10'’ and 
10'* reduction in activities, respectively. 

Additional evidence that Glul46 is not the general base was obtained by 
determining the pH dependence on activity of the E146Q mutant. If E146 were 
responsible for the ascending acidic limb in the bell-shaped pH dependence/ 
activity curve for the wild-type enzyme, then exchanging this residue for a 
non-ionizable one should abolish this limb, and a flat line at acidic pH would 
be observed. Because both the wild-type and E146Q mutants exhibit similar 
pH/ activity profiles at pH 5 - 8 (Eig. 1 3) , a general base with a similar is being 
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Fig. 13. Acat/l'^m as a function of pH for wild- type recombinant PLCgc (shaded circles) and the 
E146Q mutant (open circles) [35] 
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ionized in both cases [35]. The initial portion of both curves has a slope of one, 
consistent with a single residue undergoing ionization. 

Since it appears that the E146 side chain plays no role in activating the nucleo- 
philic water, it remained to establish the role of El 46. As noted previously, the 
crystallographic evidence also implicated E146 as a Zn2 ligand. Removal of 
metal ligands in metallo enzymes often affects their structure and stability, 
so the structures and zinc occupancies of the E146 mutants were examined. 
Indeed, the circular dichroism (CD) spectra of the E146 mutants differed 
markedly from the wild-type, and the active sites in each of these mutants were 
occupied by only two zinc ions as measured by atomic absorption. 

The weight of the available evidence therefore supports the role of Glul46 as 
a Zn2 ligand. As the primary functions of Zn2 are presumably to organize the 
active site and help neutralize the negative charge on the phospholipid substrate, 
its removal is consistent with the observed loss of enzymatic activity in the E146 
mutants. Similar catalytic changes have been documented in aminopeptidase A 
[90], alkaline phosphatase [91], and carbonic anhydrase II [92] where mutations 
of Zn ligands resulted in proteins with 0.1 - 5 % of the activity of the wild type. 



5.2.1. 2 

Glu4 as a General Base 

Inspection of Eig. 12 shows that Glu4 of PLC^c is also suitably positioned to serve 
as a general base. One of the carboxylate oxjgens of this residue is 2.7 A from an 
active site water (W2) that is located 4.7 A from the phosphorus atom of the 
inhibitor. Like Wl, this water is opposite the DAG leaving group and in position 
to attack the phosphorus atom via an associative transition state and an in-line, 
Sn 2(P) type mechanism [88]. However, the other carboxyl oxygen of Glu4 is 4.0 A 
from the methyl groups of the choline headgroup, suggesting that this residue 
could also stabilize the positive charge of the trimethylammonium moiety. 

The case for Glu4 as the general base was supported by one molecular mecha- 
nics study using the program GRID, which is designed to calculate favorable 
binding sites on macromolecules, to dock the phosphatidylcholine substrate 
into the active site of PLC^c [93]. This calculation, which was performed before 
the availability of the structure of the complex of PLC^c with the phosphonate 
inhibitor, predicted that the phosphate oxygens would coordinate with all three 
zinc ions, leaving the three acidic residues Glu4, Asp55, and Glul46 within the 
requisite 6.5 A of the phosphorus required for them to function as the general 
base. Because earlier crystallographic data suggested Asp55 and Glul46 served 
as zinc ligands, Glu4 seemed the rational choice as the general base. 

There have been two independent mutagenesis studies that have been direct- 
ed toward probing the role of E4 in the PLC^c reaction [36, 94]. In the first of 
these, the kinetic parameters kcat and of the E4L, E4D, and E4Q mutants, 
which each gave CD spectra similar to wild- type, were determined by the choline 
quantitation method [33], and these mutants were found to retain 6-60% of 
the catalytic efficiency (i.e., of wild-type [36, 64]. Eurthermore, the 

pH-dependence with activity curve of the E4L mutant was virtually identical 
with that for E146Q (Eig. 13) and similar to that of wild-type. In the other inves- 
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ligation, the E4A mutant was observed to have greater phosphomonoesterase 
activity than wild-type PLC^c [94]. 

Inasmuch as these experiments eliminated Glu4 as a contender for the gen- 
eral base, the question remains of the function of Glu4. All Glu4 mutants retain 
~ 50 % of the wild type so the loss in catalytic efficiency is due primarily to 
increases in K^. This observation coupled with the relationship of the side chain 
carboxyl group in the PLC£;.-inhibitor complex suggests that Glu4 is involved in 
substrate binding and recognition. It is known that mutations of Glu4 affect the 
processing of C6PG and C6PE but not C6PS (see Table 3), so it appears that Glu4 
may play a role in dictating the substrate specificity of PLC^c . 



S.2.1.3 

AspSS as a General Base 

One of the carboxylate oxygens of the Asp55 side chain is 2.8 A from an active 
site water molecule (W2), while the other oxygen is 2.7 A from Znl (Eig. 12). It is 
perhaps significant that this Znl-0 distance is about 0.6 A longer than other 
Zn-ligand distances in this complex, and it is about 0.4 A longer than the same 
Znl-0 distance in the uncomplexed enzyme (see Eigs. 6, 7b) [62]. The side 
chains of Aspl22, Hisl 18, and His69 are also ligands for Znl, so the importance 
of Asp55 as a ligand for Znl during catalysis is uncertain. Although Asp55 had 
been suggested as a candidate for the general base [ 93] , it was never actually pro- 
posed to perform this role until a recent molecular mechanics and dynamics 
study identified the position of the putative nucleophilic water that would be 
activated by Asp55 [95]. 

In order to elucidate the function of Asp55 in the PEGg^ catalyzed reaction, the 
site-specific mutants D55L, D55N, and D55E were isolated and purified [36]. 
Even the conservative replacement of Glu for Asp gave a protein with a 4000-fold 
lower catalytic efficiency, whereas the D55N and D55L mutants exhibited reduc- 
tions in activity of 10^- to 10®-fold! The primary changes were in with re- 
maining essentially unperturbed. The structure of the D55L mutant was essen- 
tially identical to wild-type PLCgc on the basis of comparisons of the GD, 
temperature dependent CD, and atomic absorption spectra. Because of its low 
catalytic activity, it has not been possible to determine the pH dependence of 
activity for D55N. Thus, the preponderance of available evidence supports the 
role of Asp55 as the general base in the PLCg^ catalyzed hydrolysis of phospho- 
lipids. 

If Asp55 is the general base, pH dependence curves suggest that it has a pK^ 
of 5.3 -5.5 [96], which is substantially higher than the pK^ of 3.9 that is measur- 
ed for aspartic acid in solution. However, specific protein environments are able 
to alter significantly the observed pK^s of amino acids. Eor example, Glu35 in 
lysozyme has the unusually high pK^ of ~ 6.2, a value that has been attributed to 
hydrophobic and electrostatic effects [97, 98]. Similar effects on other enzymatic 
carboxylate residues have been commonly recorded [99]. Examination of the 
immediate environment of Asp55 in various crystal structures reveals several 
factors that could contribute to its elevated pK^ [45, 62, 65]. Eor example, Asp55 
is proximal to the hydrophobic side chains of Phe66 and Ala3. Moreover, the 
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permanent dipole in a-helical structures has been found to raise the pK^s of side 
chains at the C-terminus of the helix, while the pK^s of these side chains are 
lowered when they are at the iV-terminus [ 100, 101]. Inasmuch as Asp55 is locat- 
ed at the extreme C-terminal end of a-helix B, one would expect an increase in 
its pJC^. The combination of these effects appears to override the proximity effect 
of Znl that would tend to lower the pK^ of Asp55. 



5.2.1. 4 

Metal Ion Activation of Water 

With three zinc ions in the active site, it might seem reasonable that PLC^c would 
utilize a zinc-bound hydroxide as the attacking nucleophile. Indeed, zinc bound 
hydroxides are routinely invoked as nucleophiles for zinc peptidases, amidases, 
and zinc phosphoryl transfer enzymes [86]. For PLC^,., any one of the three zinc 
ions could individually activate a water molecule, or a hydroxide bound to Znl 
and Zn3, which comprise the bi-nuclear site, could be the attacking nucleophile. 
However, activation of water by an active site zinc ion seems irreconcilable with 
the crystallographic data available for PLC^c and its complex with a phosphonate 
inhibitor (see Fig. 7b). In this complex, the non-bridging oxygens on phosphorus 
are tightly associated (2.1 A) with the three zinc ions, and the bridging water 
associated with Znl and Zn3 and both waters that were coordinated with Zn2 in 
the native structure (see Fig. 6) have been displaced. Thus, in the PLC^^-phos- 
phonate complex, no zinc bound water molecule that can be activated is visible. 
This situation should perhaps be contrasted with the structure of the complex 
of PLCscWith inorganic phosphate wherein two of the zinc-bound waters have 
been displaced, but one water remains coordinated to Zn2 (see Fig. 7 a). 

An enzyme inhibitor complex represents a static picture of an enzyme engag- 
ed in a non-catalytic event, so care should be exercised in drawing conclusions 
based upon such structures. Recognizing this caveat, a second theoretical study 
of the PLCg,. mechanism was conducted using the GRID program [102]. Model- 
ing the substrate into the active site led to the prediction that the phosphate 
moietywould bind solely to Zn2 as observed in PI nuclease (cf Fig.Sb) [69]. This 
mode of complexation then allows Znl and Zn3 to bind an active site water, 
thereby facilitating its deprotonation and activation. This version of the PLCbc 
reaction mechanism has been propagated in several recent reviews [63, 86, 103], 
but there is no direct experimental evidence that bears on this proposal. Moreover, 
use of a zinc-bound hydroxide as an attacking nucleophile does not appear to be 
consistent with the observed normal isotope effect of 1.9 (see Sect. 5.3.2) [34]. 



5.2.2 

Protonation of Diacyiglyceroi Leaving Group 

The nature of the reaction catalyzed by PLC^c in which phosphatidylcholine is 
split into diacyiglyceroi and phosphorylcholine (Fig. 11) requires two proton 
transfer steps: The first is the deprotonation of an active site water to generate 
the attacking hydroxide nucleophile, and the second is the protonation of the 
alkoxide leaving group. Although analyses of the X-ray structures of PLC^,. and 
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its complexes provided some initial insights about the general base, they offered 
little information regarding the identity of the general acid that must furnish the 
proton for the departing diacylglycerol. Despite this dearth of crystallographic 
evidence, certain scenarios involving a zinc ion in concert with bulk solvent, a 
zinc bound water, or an amino acid side chain appear feasible. 



5.2.2. 7 

Zinc Ion or Zinc-Bound Water as General Acid 

Owing to the absence of any amino acid residues within 4 A of the DAG-oxygen 
in the crystal structure of the PLCgc-phosphonate inhibitor complex [45], the 
likelihood that one of the zinc ions could participate as the general acid must be 
considered, and there are several possibilities. In the first of these, either Zn2, 
which is located 3.9 A away from the DAG methylene in the inhibitor complex, 
or Zn3, which is 2.9 A from this methylene group, could coordinate with the leav- 
ing oxygen of the diacylglycerol moiety. Inasmuch as Znl is tetracoordinate in 
this complex, it perhaps seems a more likely candidate. The transient Zn-0 
interaction would persist until the alkoxide was protonated, presumably by bulk 
solvent, as no amino acids or active site waters seem properly positioned. Such 
a role for a zinc ion is precedented in the alkaline phosphatase reaction where 
a similar function has been assigned to a zinc ion [67]. However, this reaction 
proceeds through a covalent enzyme-substrate intermediate, and it is thus quite 
different from the PLCg^ reaction. 

The second way in which a zinc ion might be involved in facilitating depar- 
ture of the diacylglycerol group is that a zinc-bound water molecule could serve 
as the general acid [104]; the of a Zn-bound water is typically in the range 
7- 9 [ 105]. In this scenario, a proton on a zinc-bound water would be transferred 
to the departing DAG alkoxide. Although the X-ray structure of the complex 
of PLC^c with inorganic phosphate reveals a water molecule coordinated with 
Zn2, there is no such water in the complex of the phosphonate inhibitor with 
PLCjc (Fig. 7). The absence of any zinc -bound water molecule renders this scena- 
rio somewhat problematic, but it is premature to draw any definite conclusions 
as structures of additional complexes of PLCgc are needed to resolve this dilem- 
ma. 



S.2.2.2 

Amino Acid Side Chain as General Acid 

In the absence of compelling evidence supporting a zinc ion or a zinc -bound 
water as the general acid, an amino acid side chain should still be considered as 
a candidate for the general acid. The difficulty is selecting a suitable active site 
residue. Consequent to a molecular mechanics simulation of the PLC^^ reaction, 
an intriguing proposal has recently been advanced that purports Asp55 as both 
the general base and as an assistant in the alkoxide protonation (Fig. 14) [95]. In 
this mechanism, Asp55 first deprotonates an active site water, and the resulting 
hydroxide attacks the phospho diester function (Step a). The DAG leaving group 
is then protonated by another water molecule, and the resulting hydroxide 
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Fig. 14. Proposed mechanism for PLCb^ whereby Asp55 serves a dual role as general acid/ 
general base 



removes the proton from Asp55, leaving the active site ready for the next catalytic 
event (Step b). 

Although this is an interesting proposal, there are some potential problems 
that argue against its likelihood. Analysis of the activity vs pH data indicates that 
the of the general acid would be approximately 10 [48, 96], and such an 
elevated pK^ would seem unlikely for an aspartic acid residue. Moreover, the 
ability of a single amino acid residue to serve the roles of general base and gen- 
eral acid is inconsistent with the bell-shaped plot of activity of PLC^c vs pH. 
Assuming that pH changes are not inducing unforeseen effects upon enzyme 
activity, such a bell-shaped curve indicates that the general base and the general 
acid are being titrated on the corresponding acidic and basic limbs. If Asp55 
performed in the proposed dual capacity, the pH dependence curve would have 
an ascending acidic limb without a descending basic limb, because the general 
acid would be continuously formed irrespective of pH. In light of this analysis 
and the experimental evidence that supports the role of Asp55 as the general 
base (see Sect. 5.2. 1.3), it is kinetically impossible for Asp55 to serve as the gen- 
eral acid. 

With the exception of Asp 122 and His69, we have replaced all of the active 
site-residues by site-specific mutagenesis [64], and we have not yet observed the 
expected thousand-fold or greater decrease in activity unaccompanied by a sig- 
nificant structural change that would typically be associated with removal of the 
general acid [ 106]. Because Aspl22 bridges Znl and Zn3,it does not seem a likely 
candidate for the general acid. In the PLCB^-phosphonate complex, the NH of 
His69 is positioned about 5.2 A from the methylene group that would corres- 
pond to the diacylglycerol oxygen [45], so it might serve as the general acid. The 
ability of histidine side chains to function as general acids has been well docu- 
mented, and the pK^ of the N-H on a histidine that is coordinated to a metal ion 
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is about ll-12[105],a value close to that estimated from pH vs activity plots for 
the general acid in the PLCgc reaction. Thus, the possibility that an amino acid 
side chain is the general acid has not yet been rigorously excluded. 



5.3 

Catalytic Cycle of PLC^c 

The reactions of all hydrolytic enzymes proceed through the basic steps of sub- 
strate binding, hydrolysis, product release, and regeneration of the active form 
of the enzyme. This sequence of events is known as the catalytic cycle, and its 
elucidation is a critical element in understanding the function and mechanism 
of an enzyme. With respect to the PLC^^ catalyzed hydrolysis of phospholipids, 
there were two questions of interest: What is the rate-determining step (RDS), 
and what is the order of product release? The rate-determining step may involve 
substrate binding or product release, which are external steps, or the hydrolysis 
reaction itself, which is an internal step. Solvent viscosity experiments attest to 
whether the RDS is an internal or external step, whereas deuterium isotope 
effects indicate if a proton transfer is involved in the RDS. If a proton transfer 
occurs in the RDS, a proton inventory plot can give insights as to how many 
protons are in flight. Product inhibition studies indicate which product is the 
last to leave the enzyme. 



5.3.1 

Viscosity Effects 

If substrate binding or product release is rate-determining in an enzymatic reac- 
tion, then the reaction will be slower in media of increased viscosity. Conversely, 
if the RDS is hydrolysis, then changes in viscosity would not affect the reaction 
rate. Inasmuch as the catalytic efficiency of PLC^^. (335 mM 's“^) [35] does not 
approach the diffusion controlled limit of 10*- 10^® M 's ^ substrate binding was 
not expected to be rate-limiting; however, this hypothesis had to be verified. Thus, 
if a diffusive step were rate limiting, product release seemed the only reasonable 
possibility. Indeed, release of DAG from the active site had been suggested 
previously as the likely rate-determining step [49]. 

To determine the effect of microviscosity on the PLCgc catalyzed hydrolysis 
of the soluble substrate l,2-dihexanoyl-sn-glycero-3-phosphocholine (C6PC), 
the kinetic parameters were obtained in solutions containing sucrose as the 
viscogen. These experiments indicated that there was little change in reactivity 
as the viscosity was increased (Fig. 15), and hence a diffusion controlled process 
such as substrate binding or product release is not rate limiting in the PLCgc 
catalyzed hydrolysis of C6PC [34]. 



5.3.2 

Deuterium Isotope Effects 

The 0-D bond has a lower zero point energy than an 0-H bond, so it requires a 
higher activation energy to reach the transition state (TS) for bond breaking. 
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Fig. 15 . Effect of various concentrations of the microviscogenic agent sucrose on the PLCg,. 
catalyzed hydrolysis of C6PC. The dashed line has a slope of one and represents data expected 
if an external or diffusive step were rate-determining [34] 



Thus, if an O-H(D) bond is broken during the rate-determining step of a reac- 
tion, there is a difference in the rates of the two reactions, the magnitude of 
which varies with the nature of the TS. If the proton being transferred is bound 
equally between two atoms in the TS,then the effect is maximized, whereas if the 
TS is product- or reactant-like, primary isotope effects closer to one will be 
observed. 

Because solvent viscosity experiments indicated that the rate-determining step 
in the PLC^,. reaction was likely to be a chemical one, deuterium isotope effects 
were measured to probe whether proton transfer might be occurring in this step. 
Toward this end, the kinetic parameters for the PLCb^ catalyzed hydrolysis of the 
soluble substrate C6PC were determined in DjO, and a normal primary deuterium 
isotope effect of 1.9 on was observed for the reaction [34]. A primary 

isotope effect of magnitude of 1.9 is commonly seen in enzymatic reactions in 
which proton transfer is rate-limiting, although effects of up to 4.0 have been 
recorded [107-110]. 

The observed normal isotope effect of 1.9 provides further evidence support- 
ing the role of Asp55 as the general base. Namely, a normal isotope effect of 1.9 
is most consistent with general base catalysis by an amino acid side chain, as 
inverse isotope effects are commonly observed when a zinc-bound water mole- 
cule, or hydroxide, is the attacking nucleophile. For example, the zinc-containing 
enzymes AMP deaminase [111], thermolysin [112], stromelysin [113], and a 
desuccinylase [114] are each believed to utilize a zinc-bound water as the 
nucleophile, and all of these reactions are characterized by an inverse deuterium 
isotope effect. This inverse isotope effect is thought to result from a dominant 
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equilibrium solvent isotope effect for proton transfer at the metal-bound water. 
If a zinc-bound water were utilized by PLCsc a normal isotope effect would be 
expected only if the proton transfer step were not at equilibrium, a situation 
markedly different from the four known cases. Indeed, we are unaware of any 
enzyme that utilizes a zinc-bound water as the nucleophile and shows a normal 
deuterium isotope effect. Thus, the observation of a normal isotope effect for the 
PLC^j reaction is prima facie evidence that a zinc -bound water or hydroxide is 
not the attacking nucleophile (see Sect. 5.2. 1.4). 



5.3.3 

Proton Inventory 

Having established that a proton transfer is involved in the rate-determining 
step of the PLC^^ catalyzed reaction, the question of how many protons were “in 
flight” in this step remained. This issue is most easily addressed by a proton 
inventory experiment in which the enzyme catalyzed reaction is conducted in 
buffers containing different mole fractions of DjO [115]. The shape of the result- 
ing plot of activity vs mole fraction of D 2 O provides information regarding the 
number of protons that are being transferred during the rate-determining step 
of the reaction. When a straight line is obtained, the activity decreases propor- 
tionally as the mole fraction of D 2 O is increased, thus indicating that one proton 
is being transferred in the rate- determining step. Thus, the proton inventory 
data, which are depicted in Fig. 16, are consistent with the transfer of a single 




n D2O 

Fig. 16. Proton inventory data for PLCg^ [34] 
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proton in the rate-determining step for the PLC^c catalyzed hydrolysis of phospho- 
lipids [34], 



5.3.4 

Product Inhibition 

Diacylglycerol has long been known to be a weak competitive inhibitor of PLC^^, 
whereas phosphorylcholine shows very little inhibition [40, 49, 116]. Recent 
kinetic assays of PLC^^ activity in the presence of DAG indicate that it is a competi- 
tive inhibitor with a of the order of 10 mM, whereas phosphorylcholine was 
found to be an extremely weak (K', = 30-50 mM), mixed inhibitor of PLC^c 
[34]. Because diacylglycerol is a competitive inhibitor of the enzyme, the nature 
of the catalytic cycle dictates that it must be the last product to leave the enzyme 
active site. 



5.4 

Mechanistic Summary of PLCg^-Reaction 

Although uncertainties remain, the available structural and experimental data 
now enable the proposal of a more comprehensive view of the mechanism and 
the catalytic cycle of the PLC^^ catalyzed hydrolysis of phospholipids than has 
heretofore been possible. The process commences when the phospholipid 
substrate binds to the PLC^^ active site (Fig. 17). The resulting enzyme-substrate 
complex is stabilized by interactions between: (1) the non-bridging oxygens of 
the phosphate group and the three zinc ions; (2) the positively charged choline 
moiety and the amino acid residues at Glu4, Tyr56, and Phe66; (3) the sn-2 ester 
carbonyl group and the NH of Asnl34; and perhaps (4) hydrophobic interac- 
tions between the sn-2 acyl side chain and Leul35, Phe66, and Phe 70 [45, 65]. 
Due to the fixed geometry of the phosphate moiety, Znl and Zn3 move about 
0.2 A closer to one another upon substrate binding, a reorganization that is 
accompanied by an increase of about 0.4 A in the distance between the side 
chain carboxyl group of Asp55 and Znl (see Figs. 6, 7b). It is perhaps this reduc- 
ed interaction with Znl that now enables Asp55 to act as the general base. 

The catalytic reaction is then initiated by the deprotonation of an active site 
water by the general base Asp55, and the resulting hydroxide attacks the phos- 
phorus of the phosphodiester group. In this step, the zinc ions stabilize the nega- 
tive charge on the phosphodiester moiety, thereby rendering it more susceptible 
to nucleophilic attack. The position of the water in the active site relative to the 
phosphorus atom of the phosphodiester moiety is consistent with an in-line 
attack via an associative mechanism, [86, 88] but there is no experimental evi- 
dence to support this hypothesis. The details of how the diacylglycerol leaving 
group acquires a proton are unknown, but the possibilities include: (1) transient 
stabilization of the DAG alkoxide by Zn2 followed by protonation by bulk water; 
(2) a zinc-bound water; and (3) an amino acid side chain. Based upon the X-ray 
structural data of PLC^,. complexes, the former two possibilities appear more 
likely, but additional structural and experimental data are required before this 
important issue can be resolved. Solvent viscosity experiments coupled with the 
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Fig. 17 . Catalytic cycle for PLCg.;. After substrate binding, hydrolysis occurs in the rate-deter- 
mining step, followed by the sequential release of phosphorylcholine and diacylglycerol. 
Amino acids known to be involved in substrate binding are shown, and zinc ions appear as 
filled circles 



observed normal deuterium isotope effect indicate that an internal, proton 
transfer step is the rate-determining step in the reaction. The two steps in the 
mechanism that involve proton transfer are the deprotonation of the active site 
water by the general base and the protonation of the DAG leaving group by the 
general acid, but it is not presently possible to determine which of these is the 
RDS. Product inhibition experiments indicate that DAG is a competitive inhib- 
itor, necessitating that it is the last product released by the enzyme; phos- 
phorylcholine is therefore the first to be released. 

6 

Perspectives 

While much is now known about the function, structure, and mechanism of 
PLCsc, there remain numerous unanswered questions. For example, it is well 
documented that PLC^c preferentially hydrolyzes micellular substrates and 
monomeric substrates with longer acyl side chains. However, the basis for the 
discontinuity in the rate of hydrolysis that is observed at the critical micelle 
concentration of the substrate is unknown as is the reason for the increase in the 
for soluble substrates as the length of the acyl side chains decrease. What is 
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the structural basis for the observed substrate specificity (i. e., PC > PE > PS) of 
PLCbc, and what roles do the three amino acid residues Glu4, Tyr56, and Phe66, 
which form the choline binding pocket, play in substrate recognition, binding, 
and catalysis? What are the energetics of the stabilizing interactions between 
amino acids in the choline binding pocket and the headgroups of different phos- 
pholipid substrates? Mechanistic questions also remain unresolved. Is deprotona- 
tion of the attacking water molecule or protonation of the departing diacyl- 
glycerol alkoxide the rate-determining, proton transfer step? What is the general 
acid? Inhibitors may often be utilized to gain insights into many of these and 
other questions, but better inhibitors must still be identified. Indeed, potent 
inhibitors might be utilized to isolate a mammalian PC-PLC and enable a more 
detailed study of these enzymes whose existence relies to date only on circum- 
stantial evidence. These and related questions will serve as the basis for future 
investigations in the exciting area of the enzymes of the PLC family. 
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Biological signal transduction pathways control the essential functions in all cells and tissues. 
Understanding the mechanisms at the molecular level accelerates the development of new 
selective medicines. The determination of the disease related molecular targets is also essen- 
tial in these efforts. Photoaffinity labeling provides a well-established technique for identi- 
fication, localization of binding proteins, and studying intracellular protein-protein interac- 
tions. The application of photoaffinity labeling in biological signal transduction is discussed 
and illustrated with a rich collection of specific examples. 
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CD2 


‘complementary determining’ 


CK2 


Protein kinase casein kinase 


COX 


cyclooxygenase 


CRF 


Corticotropin-releasing factor 


CsA 


Cyclosporin A 


DAG 


diacylglycerol 


DZR 


diazirines 


EGFR 


epidermal growth factor receptor 


FKBP 


FK506 binding protein 


FLAP 


5-lipoxygenase activating protein 


FP 


N-formyl-peptide 


GDP 


guanosine diphosphate 


GPCR 


G-protein coupled receptor 


GTP 


guanosine triphosphate 


H4Bip 


tetrahidro-L-biopterin 


IL 


interleukin 


IP 3 


D-myo-inositol-l,4,5-trisphosphate 


LT 


leukotriene 


MAP-kinase 


mitogen activated protein kinase 


MHC 


major histocompatibility complex 


NAD 


nicotinamide adenine dinucleotide phosphate 


NFkB 


nuclear transcription factor 


NHS 


N-hydroxy succinimide 


NOS 


nitric oxide synthase 


NPY 


Neuropeptide Y 


NSAID 


non-steroid anti-inflammatory drugs 


PAL 


photoaffinity labeling 


PGI 2 


prostacyclin 


PH 


pleckstrin homology 


PI 


phosphatidyl inositol 


PIP 2 


phosphatidyl-inositol 4,5-bisphosphate 


PK 


Protein kinase 


PL 


phospholipase 


PTH 


Parathyroid hormone 


PTK 


protein tyrosine kinase 


Ras 


ras gene encoded proteins 


SH2 


sarcoma homology 


SP 


Substance P 


TMDPhe 


trifluorometil-diazirine-phenylalanine 


VP 


Vasopressin receptor 



1 

Introduction 

Cells in multicellular organisms communicate with each other in order to 
harmonize their individual actions. These signal transduction pathways control 
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Fig. 1 A-C. General feature of photoaffinity labeling 



c 



several essential functions in all cells or tissues. In photoaffinity labeling (PAL) 
[1-9] a new covalent linkage is created between a light-sensitive, detectable 
ligand and the target biopolymer upon irradiation in a reversibly bound state 
(Fig. 1.). As a result of the photoinitiated covalent coupling reaction, the func- 
tional biopolymer (receptor protein or enzyme) undergoes irreversible activa- 
tion or inactivation, and at the same time the occupied functional site is labeled 
with a detectable tag which aUows easy determination of its location. Numerous 
ligand-protein or protein-protein interactions are involved in signal transduc- 
tion and PAL represents a remarkably simple and efficient method for studying 
those interactions in many aspects. 

2 

Biological Signal Transduction: Basics and Opportunities 
for Photoaffinity Labeling Studies 



2.1 

The Diversity of Signaling Pathways 

The major intracellular signaling pathways [10-13] are shown in the radically 
simplified Scheme 1. The simplest way of signaling utilizes lipophilic substances 
like steroid hormones which directly pass through the cell membrane and induce 
a response inside the cell without the assistance of second messenger molecules. 
The other fairly simple way of delivering a message into the cell utilizes ions 
as direct messengers. Cations or anions use different membrane-bound ion- 
channels that open up after binding of an agonist molecule on a specific site 
of the channel. On the other hand, the majority of the extracellular chemical 
messengers are fairly big and hydrophilic, which prevents direct crossing of 
the membrane, and consequently other ways were evolved. Communication 
through the cell membrane utilizes either membrane embedded receptor- 
enzymes or membrane-coupled receptors, which upon binding of the chemical 
messenger stimulates conformational change. That alteration initiates a series of 
intracellular events including enzyme activation, protein-protein interaction 
and translocation, or release of second messengers. 

One important class of these systems is the membrane embedded enzyme- 
receptors such as tyrosine kinases. The binding of the extracelullar agonists 
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Scheme 1 . A simplified overview of the intracellular signaling pathways 



to the receptor-enzyme modulates the activity of the biocatalyst causing 
phosphorylation of the tyrosine residues on certain target proteins within 
the cell. This type of signal transduction is used by growth hormones and 
insulin etc. 

Membrane-associated receptors are linked to transducing proteins (like 
G-proteins) in the inner portion of the membrane. G-protein coupled receptor 
(GPCR) families comprise a major class of the receptors that are pharmaco- 
logically relevant, such as muscarinic acetyl choline receptors, adrenoceptors, 
dopamine receptors, serotonine, opiate, peptide hormone, purinerg receptors, 
and also sensory chemoreceptors. A large variety of subtypes are described in 
the pharmacological literature. 

Many of those receptors have already been cloned, which revealed a consis- 
tent pattern (40%) with conserved homologies. The majority of GPCR receptors 
consist of a single polypeptide chain of 400 -500-amino acid residues, and all 
possess seven transmembrane a-helices. A highly conserved region, the long 
cytoplasmic loop enables the receptor to couple to a heterotrimeric guanine- 
nucleotide-binding transducer protein (G-protein). This protein mediates the 
signaling from the cell surface receptor via conformational changes upon bin- 
ding to a variety of intracellular effectors. 

G-proteins consist of three characteristic subunits: a-subunit (36-52 kDa), 
/^-subunit (35 - 36 kDa), and y-subunit (8 - 10 kDa). The f3- and y-subunits form 
a strong dimer, while the a-subunit is able to bind guanine nucleotides. Upon the 
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agonist activation GDP is replaced for GTP, which induces the disintegration of 
the G-protein trimer liberating the a-subunit. As a result, it interacts with effector 
systems, which can be either enzyme controlled or simply an ion-channel. The 
intrinsic GTPase activity of the a-subunit slowly converts GTP back to GDP, 
completing a signaling cycle. 

In studying G-proteins, several tools were developed. For example, certain 
toxins are able to modify covalently the G-proteins, others inhibit the GTPase 
activity, which allows a permanent activation. PAL can provide the same per- 
spective, whereas photolabile agonists or GTP analogues covalently modify the 
receptor or G-proteins. 

The y-subunit contains a characteristic CAAX motif, which allows covalent 
attachment of prenyl groups and anchors the protein to the inner surface of the 
membrane. 

Independently the p- and y-subunits have accompanying signaling function. 
As recently shown they participate in the activation of mitogen activated kinases 
(MAP-kinases). 

Much knowledge has been gathered about signaling mechanisms of 
G-protein to date, but many questions remained unanswered and new areas 
of investigation have opened. PAL is a particularly valuable tool for studying 
intracellular protein-protein interactions and the diverse function of second 
messengers. 

Two major second messenger systems have been evolved. When the a-sub- 
unit is temporarily linked to adenylate cyclase, the enzyme is activated and 
catalyzes the transformation of ATP to cAMP. This second messenger activates a 
specific protein kinase, which in turn phosphorylates target proteins in the cell 
leading to the overall physiological response. Such a pathway is utilized by the 
/^-adrenoceptor, dopaminergic, and prostaglandin receptors, etc. 

In an alternative system the ultimate cellular response is the increase of the 
intracellular Ga^'^-level. A minor phospholipid (PI, phosphatidyl inositol) is 
sequentially phosphorylated to PIP 2 (phosphatidyl-inositol 4,5-bisphosphate) 
and that serves as a substrate for the effector enzyme (phospholipase G, PLGy) 
regulated by cell-surface receptors via G-protein a-subunit. PLGy cleaves PIP 2 
yielding the water soluble D-myo-inositol-l,4,5-trisphosphate (IP3) and a lipid 
soluble diacylglycerol (DAG). IP3 is a major Ga^'^-releasing second messenger, 
which activates a Ga-channel in the endoplasmic reticulum. The other second 
messenger, DAG activates a specific protein kinase, PKG, promoting phospho- 
rylation of target proteins in the cell. 

In summary, it is characteristic of the GPCR signaling system, that the ligand 
binding biopolymer and the effector enzyme are separate proteins and the 
enzyme is activated by the assistance of a receptor coupled guanylate binding- 
protein. 

Gonversely, in the tyrosine kinase pathway the effector enzyme is either an 
intracellular portion of the cell-surface receptor or directly coupled to it without 
the assistance of a transducer protein. 
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2.2 

Signal Transmission from Seconds to Hours 

Interestingly, the signaling speed is relatively fast in the G-protein coupled 
systems. The signal is transmitted in seconds from the surface receptor to the 
destination inducing the cellular changes; however, the simpler direct phospho- 
rylation pathway requires minutes to get changes in the cellular properties. In 
the tyrosine kinase pathway the chemical messenger binds to the large extra- 
cellular domain, which is connected via a single a-helix to the intracellular cata- 
lytic region. Upon binding the receptor dimerizes, followed by autophophoryla- 
tion of their tyrosine residues. The emerging phosphotyrosine residues act as 
acceptor sites for a unique peptide sequence (SH2 domain) of a distinct protein 
family. That binding initiates activation of various cell control functions through 
enzyme or transcription factor activation. 

Another receptor-enzyme system was elucidated, which forms a guanylate 
cyclase. This enzyme converts GTP to cGMP upon receptor binding on the outer 
surface. The resulting cGMP activates either a protein kinase or a phospho- 
diesterase. 

The previously discussed simplified signaling systems all point in one direc- 
tion [10]. Beside several cell controlling functions they initiate gene transcrip- 
tion/expression. That general phenomenon allows detection of the binding to 
the cell surface receptors at genetic level, establishing pharmacogenetics. On the 
other hand, this new approach enables easy identification of any gene-product 
(signaling proteins and enzymes) involved in the signaling cascade and allows 
one to study the pathways as a whole. 

Finally, steroids and thyroid hormones interact directly with the ultimate 
target, and stimulate transcription factors and gene expression. These lipophilic 
ligands readily cross the membrane and bind to nuclear proteins, which consist 
of a highly conserved DNA binding domain, and recognize a particular base- 
sequence. Upon agonist binding they activate the corresponding gene leading to 
increased protein synthesis. These events frequently require hours. 

In summary, the duration of the different signaling pathways can range from 
seconds to hours. Since irreversible activation and inactivation can be controll- 
ed by photocovalent attachment these versatile tools are very useful in studying 
the signaling pathways as noted. The traditional photoexcitation enabled only 
investigating processes taking place within several minutes to hours. Now new 
laser pulse techniques [14] allow the establishing of photocrosslinking within 
nanoseconds, opening up brand new perspectives for studying very fast signal- 
ing cascades. Laser flash photolysis has already been widely used in initiating 
signals in several biological systems using the inverse technology to PAL. In that 
methodology a photoreactive protecting group masks the pharmacophore in 
agonist or messenger molecules, causing a loss of biological activity (since the 
biologically active species are trapped in an inactive state they are frequently 
referred to as ‘caged’ compounds). Upon irradiation, the bond between the 
protecting group and the ligand is cleaved and the biologically active species is 
recovered within a very short time, allowing one to detect the fast kinetic conse- 
quences of the signal initiation [8, 15, 16]. PAL and photorelease of bioactive 
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compounds from “caged” compounds can be combined in many creative ways 
and their potential has not been fully exploited. One recent report demonstrates 
it. One of the photophores creates a cross-link between the ligand and the 
receptor, while activation of the other photoreacting group under different 
condition enables one to remove the ligand with the label from the protein- 
fragment after separation and digestion. Finally, the released peptide fragment 
can be identified with MS/MS sequence analysis [17]. 



2.3 

Identification of New Drug Targets in the Signaling Pathways: 

Another Important Role for Photoaffinity Labeling 

Several disease states are directly linked to the malfunction of the receptor or 
the signaling system. Identifying the most specific disease-related targets in the 
signaling machinery and gaining information about the binding domain are 
very important for selective drug design. 

Since these proteins are difficult to crystallize and site-directed mutagenesis 
has certain limitations, PAL can find much use in exploring pathways, trans- 
ducers, and effectors in signal transduction as well as some of their main struc- 
tural features. 

The potential of photoprobes is increasingly utilized as a remotely controll- 
able tool in the drug discovery process. The difference between the dark state 
stability and light sensitivity allows the separation of two actions: 

1. Introduction of a photoreactive group to biologically active molecules and 
setting all the parameters for the experiment. 

2. Specific photochemical reaction can be initiated at will by remote irradiation 
activating a photoreactive structural unit (photophore), which results in the 
formation of a new stable covalent linkage. 

PAL has been known for 30 years, but a new era of applications has appeared 
recently through the evolution of more efficient photophores and activation, as 
well as new high-resolution separation and detection techniques. The photo- 
covalent modification of the binding site, which has an irreversible effect on 
activity and places a (radio )label, enables a multilevel analysis and identification 
on the biological target. 



2.4 

Different Levels of Protein Identification in PAL 

The result of a standard photoaffinity experiment can provide an output in 
three levels [8] as shown in Scheme 2. The photolytically induced targeted 
introduction of a (radio)label onto the target biopolymer helps to identify 
any specific binding proteins (first level of identification) responsible for a 
particular action in the signaling cascade or to identify the binding domain 
within the target protein (second level of identification) or allows one to 
identify the amino acid sequence of the binding protein (third level of identifica- 
tion). 
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Scheme 2. The three levels of identification of biopolymers provided by photoaffinity labeling 



The design of the photoprobe is based on structure-activity relationship 
(SAR) studies if available. Ideally the photoprobe should be bioactive over the 
same range as its parent compound. The next step is to synthesize the bioactive 
photoprobe in radiolabeled form. Similarly, non-radioactive labels (primarily 
biotin) can also be attached via a linker arm [18]. 

In the classical photoaffinity experiment a tissue homogenate is solubilized 
by standard detergents and incubated with the radiolabeled photoprobe for 
non- covalent binding prior to irradiation. By illumination at the excitation 
wavelength a covalent linkage is established between the ligand and receptor 
around the binding site as shown in Scheme 2. 

After maximum radioactivity incorporation the protein is denatured and 
generally subjected to HPLC or gel-electrophoresis. Those methods separate the 
proteins from the specific tissue by size and the radioactivity distribution can be 
determined among the protein components. The specifically labeled bio- 
polymers are distinguished simply by a competition experiment performed by 
the addition of excess of non-labeled parent ligand. It eliminates the radio- 
activity incorporation. 

The specifically labeled proteins are isolated and subjected to limited proteo- 
lytic digestion or chemical fragmentation. The highest (radio) activity fragments 
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are either localized by immunological methods or comparison with known frag- 
mentation patterns (analyzed by Maldie MS or MS/MS) or subjected to amino 
acid sequencing. The result is frequently subjected to homology search with 
known sequences in order to identify conserved regions and the relationship of 
the newly discovered biopolymer to existing protein families. The identified 
short sequence could also promote finding the right cDNA, which allows genetic 
recombination of the complete receptor. 

3 

Practical Overview of the Current Photoaffinity Techniques 



3.1 

Summary of the Most Frequent Photophores 

There are several photoreactive groups and structural units (“photophores”) 
used in PAL [3, 4, 6, 8]. The main criteria for the selection are: 

1. The photoreactive groups should have sufficient stability under ambient 
light. 

2. The life-time of the photo chemically generated excited state should be shorter 
than the dissociation of the ligand-receptor complex, but long enough to 
spend sufficient time in a close proximity to a target site for covalent linkage. 

3. The photophore should have an unambiguous photochemistry to provide a 
single covalent adduct. 

4. The excited state or radical should react preferably with unreactive C-H 
groups yielding a uniform and stable covalent label rather than interact with 
nucleophiles including solvents of that type. 

The activation wavelength of the photophore should be higher than the poly- 
peptide UV absorption (>300 nm). 

Several comparative studies are available in the literature, which clearly shows 
that, according to the latest results, tetrafluoro-phenyZ azides (AZ), trifluoro- 
methyl-phenyl diazirines (DZR), and particularly benzophenone (BP) are the 
best choice (Scheme 3.). Diazocarbonyl compounds, which played a historically 
role in the evolution of PAL, can also be considered together with aryl-diazonium 
salts [6] (not shown). For aryl azides the more common unsubstituted photo- 
phore is presented. It should be noted that tetrafluoro-phenyl azides have an 
increased reactivity towards CH-groups and they do not rearrange. Other sub- 
stituted phenyl azides can readily alter the excitation wavelength allowing the 
application of milder or differential photo activation. 

Three of the four photophores, listed in Scheme 3, eliminate nitrogen upon 
excitation while the biradicaloid triplet state of benzophenone can be reversibly 
activated, and creates covalent linkage upon excitation-relaxation cycling. This 
fact and their extremely low reactivity towards protic solvents make them very 
efficient in the majority of the cases. [7, 19, 20]. Unsaturated ketones can be 
activated by a similar mechanism, although the secondary processes are much 
more complex [21]. That mechanism is mainly utilized in steroid hormones 
possessing unsaturated ketones as intrinsic photophores. 
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Benzophenone photochemistry 
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On the other hand, aryl azides can be introduced without increasing the 
bulkiness of the ligand if the aryl substructure is already present, and this 
photophore also allows fast excitation and efficient radiolabeling, maintaining 
the popularity of that group among the protein target hunters. 
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3.2 

Main Principles for the Synthesis of Photoaffinity Probes 

While the majority of the chemistry strategies and synthesis design is equivalent 
to standard analogue producing approaches, a unique feature can be taken into 
account based on the relative photosensitivity of the probes. The introduction of 
the radio or other types of labels can also modify the reaction sequences. 

The following principles can be determined: 

1. During the chemical synthesis the photophore should remain intact, and 
therefore in many cases a precursor is constructed early in the synthetic 
steps, and in the final step that precursor structural unit is converted into the 
photolabile moiety (linear approach). 

2. Most practically, the photoreactive group or its precursor can be directly 
attached to the parent biologically active ligand at a suitable functionality or 
complementing a structural unit already present (semisynthetic approach). 
Direct benzoylation of an aromatic ring forming a benzophenone, or azido- 
salicylic acid attached to hydroxyl groups can represent two examples). In 
that approach the photophore is generally designed to be part of the pharmaco- 
phore (endo-photoaffinity ligands), which allows more accurate identifica- 
tion of the binding site. 

3. In the absence of suitable functionality for direct attachment a small linker 
sidearm can be introduced, which contain an appropriate functionality at the 
terminal position. 

Practically, a terminally protected linker can be built at any stage of the syn- 
thesis allowing the attachment of the photophore group in the final step. 




ligand 



tether radiolabel photophore 



reporter group 

Fig. 2. A typical construction of a tethered photoaffinity probe 
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frequently in a radiolabeled form (tethered ligand approach). A typical 
example, an amino-alkyl tethered inositol-1, 4, 5-trisphosphate, is shown in 
Fig. 2, indicating the maximum span (15 A) between the phosphate residue 
and the photo activation site [22,23]. 

Relatively large numbers of photoreactive, heterobifunctional reagents are 
available commercially, and simple precursors or photophore synthons can 
also be utilized (the most commonly used heterobifunctional cross-linker 
reagents are listed in Fig. 4). The tethered ligand approach not only saves time 
and effort but, by using a flexible linker with optimal length, the probability 
of the photochemical attachment can be increased. On the other hand it may 
cause multiple amino acid attachment around the bound ligand within 
the binding site. In that approach the photophore, connected via a linker of 
defined length is generally placed far from the pharmacophore; therefore it 
does not interfere with the binding but creates covalent linkages at a defined 
distance (exo-photoaffinity ligands). 

4. Another important principle is related to the synthetic efficiency. The ligand 
derivatization and tethering should be achieved with the minimal modifica- 
tion of the total synthesis of the parent ligand. 

5. If de novo synthesis is necessary, the main principle is to use a strategy that 
allows one to synthesize a series of photo affinity ligands with the photophore 
in multiple position around the pharmacophore. Using that approach the 
investigators can test a multiple set of photoprobes, which gives a better chance 
to find biologically active species and a definite mapping of the binding site. 

The most frequent synthetic approaches, smnmarized in Scheme 4, are towards 
the primary photophores. The preparation of aryl azide derivatives follows the 
typical retro- synthetic pathway in the majority of the reported cases (Scheme 4 A), 
and, practically, diazotation is the most commonly used procedure [24 - 29] . In the 
case of diazirines only one major synthetic sequence is repeated: ammonolysis of 
oximes followed by dehydrogenation (Scheme 4B) [30-32]. There are various 
ways of preparing diazo- or diazocarbonyl-compounds; most frequently the 
Forster and Bamford-Stevens reactions (Scheme 4C) are employed [33-37]. 

Benzophenones are usually attached as a complete photophore. Apart from 
the standard chemical techniques, new C-C coupling procedures extend the syn- 
thetic repertoire. However, in some cases direct benzoylation (e.g., Friedel- 
Crafts acylation) of aromatic or heteroaromatic rings can provide an easy access 
to BP or BP-like photophores (Scheme 4D) [38, 39]. 



3.3 

Photoreactive Amino Acids and Heterobifunctional Cross-Linkers 

In the chemical communication many peptide-protein (e.g., peptide hormone 
agonist) or protein-protein signaling interaction take place. Various photoreac- 
tive amino acids, e.g., azido-phenyalanines (Apa, TFApa) [40, 41], benzoyl- 
phenyalanines (Bpa, p-OH-Bpa) [4 1 , 42 ] , trifluorometil-diazirine-phenylalanine 
(TMDPhe) [43], were developed (Fig. 3), which can be incorporated into any 
places in peptide sequences by standard solid-phase synthetic techniques. 
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Scheme 4A-D. Selected synthetic approaches to the major “photophores” 



A systematic replacement of any amino acid in the sequence for photoreac- 
tive analogues allows a photoaffinity ‘scanning’ of the binding interface. Since 
solid-phase synthesis is limited in the length of the peptide, Schultz et al. devel- 
oped a sophisticated method which makes it possible to incorporate unnatural 
amino acids into large peptide sequences. The photoreactive amino acid was 
linked to transfer RNA, which inserted the amino acid into the required position 
by in vivo translation [44]. 

The major requirements for photo activatable amino acids are that they 
should exhibit high optical purity and contain appropriate protecting groups for 
solid-phase synthesis; there is also a particular advantage if they contain 
the radiolabel. While p-azido-phenylalanines can easily be labeled with and 
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TWlDPhe 

Fig. 3 . Frequently used photoactivatable amino acids 



tritium [40,41], Bpa needed a multi-step synthesis to incorporate tritium [45]. 
More recently Maggio et al. reported the synthesis of p-HO-Bpa which allows 
efficient radioio dination adjacent to the hydroxyl groups [46]. 

Alternatively heterobifunctional cross-linker reagents [47] can be attached to 
specific amino acid functionalities in peptide sequences (e.g.,to lysine e-amino 
groups or cystein mercapto groups). These reagents contain a functional group 
specific anchoring unit, photoreactive reporter group with radiolabel. Some 
frequently used heterobifunctional crosslinkers are shown in Fig. 4, such as the 
amino-group specific NHS esters, the iodinatable ASA, and tritiated [^H]-BZDC 
[48]. The thiol group specific cross-linkers are to be attached to cystein such as 
MBP [20] and the cleavable ABAPS [49]. 

The application of heterobifunctional cross-linkers allows macromolecular 
PAL to probe protein-protein interactions, including subunit interactions and 
location, monitoring the conformational changes induced by signal transmission. 
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[3H]-BZDC-NHS 




Fig. 4. Frequently used photoactivatable bifunctional cross-linker reagents 



3.4 

Detection/Labeling Possibilities for Cross-Linked Sites 

The primary feature of the photoaffinity technique is not only to create a co- 
valent cross-link between the ligand and the binding protein causing permanent 
activation or inactivation but at the same time a detectable tag is placed at the 
functional domain, which allows easy localization. Even if the non-radio active 
labeling technique is rapidly developing, still the majority of the recent photo- 
probes utilize radioactive detection [50]. There are two different ways of label- 
ing: in one approach the tag is joined directly to the photophore; alternatively it 
is placed somewhere else in the photoprobe. In the first approach, practically, the 
label should be placed in a precursor or heterobifunctional reagent containing 
both the label and photophore. The second method is commonly used in 
short peptide photoaffinity ligands (tyrosine residues are good targets for 
radioiodination). 

The most common radionuclei are ^H, and in oligonucleotide probes 

The highest specific activity can be achieved with - on the other hand 
y-irradiation requires more safety precautions. The ease of incorporation into 
aryl-azides (e.g., azidosalycilic acid [ASA] derivatives) and tyrosine residues by 
Chloramine T/Na'^^I makes it particularly popular in spite of the obvious dis- 
advantages. In a new coupling technique organotin derivatives are replaced 
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Fig. 5 . A biotinylated heterobifunctional cross-linker reagent 



with iodine with high efficiency. Tritium offers a relatively safe alternative, 
although the high-specific activity incorporation frequently needs multi-step 
well-designed synthesis and selective tritiation. Another problem with tritium 
is that in certain positions it is easily replaceable with hydrogen. Halogen-triti- 
um exchange reaction, catalytic tritiation, or reduction with NaB^H 4 are the 
choices for its introduction. 

The non-radio active labeling utilizes fluorescence, chemiluminescence, or 
biotin/avidin interactions. Capillary electrophoresis with laser-induced fluor- 
escence was first employed in PAL by Miller et al. [51]. Gilbert and Rando 
recently reported several biotin-containing heterobifunctional reagents and 
used them successfully [18] (Fig. 5). 

The rapid development and sensitivity of the mass spectrometric methods 
can be foreseen and in the near future the labeling can be more frequently 
eliminated. The identification of the cross-linked peptide can be detected first 
with immunological methods and then the digested and cleaved fragments with 
specific tandem MS techniques. The different photophores hold discrete MS 
fingerprints, which allow fast recognition of the modified sites. 

4 

Specific Examples of the Application of Photoaffinity Labeling 
in Signaling Pathways 

The specific examples are classified according to the major signaling pathways 
and following the track of the signal from receptor binding to ultimate intra- 
cellular destination. 
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The account is primarily focusing on multi-protein signaling cascades, while 
other pathways are not discussed in detail, and only mentioned to illustrate their 
position in the colorful world of cell to cell communication. 



4.1 

Photoaffinity Labeling of Ion-Channels 

Ion-channels have been extensively studied with photoaffinity labeling since the 
earliest years and were the subject of several excellent reviews [52, 53] and 
papers. For illustration some of them are listed here on calcium channels [54, 55] 
and sodium-channels [56,57]. 

Receptor binding in neuronal synapses causes ion-release inducing very fast 
polarization or depolarization. The main signaling event in the ion-channel 
interaction is a conformational change which allows opening of the narrow pore 
within the protein transmembrane subunits for anions, or cations. One example 
is given here, which provides evidence that other associated biopolymers also 
participate in the channel-blocking interactions even if they are not directly 
involved in the channel-opening effect. 

Nakanishi studied philanthotoxin (polyamine-amide) interaction with nico- 
tinic acetylcholine ion-channel [58]. Philanthotoxin- 133 (PhTX-133) is a non- 
competitive channel-blocker found in venom of the wasp Philanthus. Nicotinic 
acetylcholine ion-channel is composed of five transmembrane subunits (a, o', 
/?, y, and S), which forms a 270-kDa glycoprotein. The major acetylcholine bind- 
ing sites are in the a,o^ subunits. A 43-kDa cytoplasmic protein is associated 
non-covalently with the receptor, but interaction with the receptor is not essen- 
tial for the channel opening (Fig. 6). 

Interestingly, the aryl-azide modified PhTX-133 (1) photoprobe labeled all 
the subunits upon irradiation when the 43-kDa protein was not present, while 
the expected a-subunits were labeled only in the presence of that cytoplasmic 
protein. That finding provided evidence that the additional protein is necessary 
for orienting the toxin to the binding region. The binding site of the toxin 
was identified in the cytoplasmic loop of the a-subunit. In the paper a com- 
plete binding site architecture was postulated. 



4.2 

Photoaffinity Labeling of G-Protein Coupled Receptors and Other Protein Targets 
Involved in GPCR Signaling Cascade 



4.2.1 

G-Protein Coupled Receptors 

G-protein coupled receptor family comprises most well-known cell surface 
receptors including the major drug targets, as previously stated. Early PAL 
results have been reviewed in several papers, and book chapters. For opiate, 
NMDA, sigma, benzodiazepine, GABA, acetyl choline, and adrenerg, serotonine 
receptors see [52, 59, 60], and for purinerg, histamine, and dopamine receptors 
see [61]. 
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Fig. 6. Photoaffinity labeling of the nicotinic acetylcholine ion-channel with a Philanthotoxin 
probe 



Beta-adrenerg receptors are one of the most widely studied systems in 
pharmacological sciences. Catechol and chatecholamines interacts primarily 
with the transmembrane domain-5 (TM-5) at Ser-204 and Ser-207 as well as with 
TM-3 at Asp-113 in /l 2 -adrenerg receptors (P 2 -AR). Salmeterol is a long-acting 
jSj-AR agonist, which exhibits unusually slow dissociation from the receptor. In 
addition to the regular catechol interaction it has been proposed that salmeterol 
interacts with an additional site termed “exosite”. The current model postulates 
that salmeterol is tethered by its long side-chain to the exosite with high affinity 
and that site anchors to preserve the agonist in the catechol binding site. To 
prove experimentally the existence of an exosite, an aryl azido derivative of 
salmeterol (2, Fig. 7) was prepared and used in photoaffinity experiments [62]. 
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Fig. 7. Photoaffinity probes for studying G-protein coupled receptors 
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In fact, the majority of the radiolabel was recovered in the small C-terminal 
fragment containing TM-6 and TM-7. It delivered biochemical evidence that 
the aryloxyalkyl side-chain of salmeterol was positioned in that region and 
represented a secondary binding site, while the pharmacophore portion of the 
compound bound to the expected catechol site. 

Calcitonin, an inhibitor of bone resorption, is a 32-amino-acid peptide hor- 
mone which acts through a G-protein coupled receptor on the target cell surface. 
To our knowledge, calcitonin activates not only Ga-subunits, but also other 
G-proteins linked to PLG, PKG enzyme activity as well as IP3/Ca^'^ releasing 
pathways. The receptor belongs to the same family of receptors like parathyroid 
hormones, secretin, vasoactive intestinal peptides or glucagon-like peptide- 1. To 
elucidate the molecular signaling mechanism upon receptor activation an array 
of photoaffinity scanning analogues was prepared with altered structures. Thus, 
first all Lys residues were replaced for Arg in salmon calcitonin, followed by 
replacement of hydrophobic amino acids for photoactivatable Lys(e-p-benzoyl- 
benzoyl) groups. Val-8, Leu-16, and Leu-19 were successively replaced for 
the photoactive structural unit. The N-terminus was also modified with the 
photophoric group. The photoreactive calcitonine analogue specifically labeled 
a 70-kDa single protein. The size of the protein was higher than expected, which 
might be explained by posttranslational glycosylation. Further studies will 
reveal the exact modification site of the photo-crosslinking [63]. 

Cholecystokinine (CCK) receptor was extensively studied with different re- 
search groups, which allow an excellent opportunity for comparison. GCK plays 
a role in GNS modulating dopamine release, and is thus involved in neurological 
disorders, e.g., panic attack. Both subtypes of GCK bind to G-protein coupled 
receptors, which causes stimulation of the phospholipase Gy with release of IP3 
ultimately leading to an increase in intracellular Ca^"^ • level. CCK has additional 
effects on the gastrointestinal tract. The gastrin and CCK families share a common 
C-terminus (Trp-Met-Asp-Phe-NH2) that functions as the minimum bioactive 
motif. The peptide can be found in neurons as an octapeptide with the same 
motif as described. Baculovirus expressed receptor (CCKj, subtype) was inves- 
tigated with two photolabile, modified octapeptide portions of CCK, which 
contained photoactivatable p-benzoyl-benzoyl moieties and a p-aminobenzoyl 
residue instead of Met-28 and Gly-29. The main labeled band corresponded to a 
45-kDa protein, which was previously confirmed by immunoprecipitation. After 
deglycosylation a 41-kDa protein was retained [64]. 

Miller reported several monomeric, photolabile CCK agonists and antagonists. 
The photoreactive residue (L-Bpa) was placed at the M-terminal and a fluor- 
escent reporter group was also linked to it. The CCK receptor in the study 
was expressed on Chinese hamster ovary-CCKR cells. To identify the labeled 
domains on the receptor capillary electrophoresis was used with laser induced 
fluorescence detection. Separate regions were labeled with the two photoprobes, 
one labeled the first extracellular loop (96- 121), while the other probe labeled a 
fragment in the second extracellular loop (174-195). 

A receptor model was deduced based on the labeling pattern [51]. 

Several non-peptidic CCK photoaffinity probes were synthesized by Darrow 
et al. [65]. Benzophenone and diazirine photophores were linked to 1,5-benzo- 
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diazepine derivatives, (3 a, b, Fig. 7) and the photoprobes exhibited either agonist 
or antagonist activity in rat amylase assay. 

Secretin receptor is a member of the class II G-proteins, sharing only 12% 
homology with rhodopsin//?-adrenerg receptors. Secretin receptor families have 
long amino acid terminal domains containing six highly conserved Cys amino 
acids, which participate in disulfide bonds. Mutagenesis studies suggested that 
the referred domain had a critical role in binding. [Tyr'°, Bpa^^]-secretin-27 was 
a full agonist and stimulated cAMP accumulation. Upon photolysis the photo- 
labile ligand labeled a 57 - 62-kDa band. After cyanogen bromide treatment the 
majority of the radioactivity was recovered in a 19-kDa band, which was de- 
glycosylated into a 9-kDa polypeptide. Multiple immunoprecipitation and further 
proteolysis resulted in a binding site model, which located in the N-terminal 
region [66]. 

Thrombin is multifunctional serine protease involved in regulating clotting 
processes, wound healing, and inflammatory responses. During activation a 
tethered small peptide ligand is released from thrombin N-terminal by enzymatic 
self-slicing and acts on the G-protein- coupled thrombin receptor. After standard 
signaling pathways Ca^"^ is released, which converts fibrin-stabilizing factor to 
fibrinoligase which strengthens the fibrin-fibrin links in the presence of Ca^"^. 
The thrombin receptor activating peptide (TRAP) is a short sequence which 
corresponds to the sequence cleaved from thrombin and activates the receptor 
itself even if not linked to the enzyme. In order to study the thrombin receptor, 
a photolabile Bpa-FLLRN-NH 2 analogue was first synthesized. That antagonist 
was an analogue of the minimum active sequence SFLLRN-NH 2 . In a radiolabel- 
ed form Bpa-FLLRN-NH 2 specifically labeled four proteins in blood-platelets 
but the radioactivity incorporation was weak and prevented further studies 
(G. Dorman, J.T. Elliott, and G.D. Prestwich, unpublished data). In order to 
improve the radiolabeling efficiency Elliott prepared a series of new Bpa-photo- 
probes based on a recently developed receptor antagonist [67]. 

Alpha-melanocyte-stimulating hormone (a-MSH) is a 13-amino acid-contain- 
ing peptide which is a member of the anterior pituitary peptide hormones. 
There are three subtypes, a, p, and y. It has been proven that in fish and amphibia 
it controls pigmentation. On the other hand, in human the exact role is still 
unknown although it might play a part in temperature control. Eberle prepared 
a photo activatable a-MSH analogue, which contained three aryl azides at 1, 9, 
and 13 position (4, Eig. 7). The photophore at position 1 was connected by a cleav- 
able disulfide linkage. Photocrosslinking of the a-MSH receptor on melano- 
phores of Anolis carolinensis led to almost a complete receptor inhibition, which 
was transferred into long-lasting receptor stimulation by exposure to a disulfide- 
cleaving reagent disconnecting the crosslink between position 1 and the receptor. 
This result allows the receptor to become arrested in an activated or inactivated 
state and also allows a switchable change between the states. Since a-MSH 
receptor belongs to the G-protein coupled receptor families, this major intra- 
cellular communication pathway can be studied with the present methodology 
upon turning on and off the signals [68]. 

Angiotensin II is an octapeptide hormone that plays an important role in the 
regulation of the cardiovascular system (blood pressure and sustained hyperten- 
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sion). Angiotensin II receptors are coupled to the effector system via G-proteins, 
which results in regulation of Ca^"^ current in sympathetic neurons. Primarily, 
stimulation of angiotensin II receptors induces the PLCy/IP 3 signaling sequence, 
and ultimately results in an activation of the Ca^'^/calmodulin system. A hexa- 
peptide portion of angiotensin II (Phe-Pro-His-Ile-Tyr-Val) exhibits its activity 
on a unique binding site, as recently discovered. Bernier studied the AT-4 receptor 
by PAL. The N-terminal amino acid was replaced for Bpa and azido-phenyl- 
alanine (Apa) forming two different types of photoprobes. Both photoligands 
performed over 60% covalent labeling. The primarily labeled 186-kDa protein 
was deglycosylated giving a 129-kDa biopolymer. Mild trypsin treatment 
released a large fragment, which constituted the majority of the protein, reveal- 
ing that AT-4 the receptor is a single transmembrane protein resembling the 
family of growth factor or cytokine receptors (see later) rather than a G-protein 
coupled receptor as the AT-2 or AT-3 receptor [69, 70]. 

Substance P is a neuropeptide and member of the tachykinin family, released 
in nociceptive sensory or enteric neurons and in many regions of the brain. SP 
remained a pharmacological curiosity with implication in several disease states 
(including pain modulation and inflammation) which derives from wide range 
of physiological responses in many cell types. SP receptor binding activates the 
PLG y/lPj/Ga^"^ intracellular signaling pathways via a G-protein. The widely 
distributed SP related neurokinin (NK) receptors were extensively studied by 
photoaffinity probes in order to identify specific binding sites, which could be a 
starting point for efficient antagonist design. First a Bpa-modified undeca- 
peptide was synthesized by Boyd et al. where the Phe-8 in the C-terminal was 
substituted with Bpa (Bpa*-SP). Upon irradiation the photoprobe labeled two 
polypeptides (46 kDa and 53 kDa) and further studies revealed that the lower 
protein was a proteolysis product [71]. The very efficient crosslinking (63%) 
allowed further investigation using SDS-PAGE and MALDI MS analysis. This 
led to the identification of the exact attachment site as a methionine residue 
(Met-181) in the second extracellular loop (178-183) [72]. Maggio et al. studied 
murine SP receptor with two photolabile analogues (SP-Bpa^ and SP-Bpa^). The 
initially identified 75-kDa band was shifted to 42 kDa followed by deglycosyla- 
tion, which was the weight expected from cDNA sequence. Interestingly SP-Bpa^ 
interacted with the N-terminal extracellular tail, whereas SP-Bpa® labeled a region 
in the second extracellular loop [73]. The same group developed for SP receptor 
labeling a new radioiodinatable benzophenone containing amino acid (p-hy- 
droxybenzoylphenyl alanine, p-OH-Bpa) (see Fig. 3) allowing one to unite the 
photophore and the radiolabel in the same structural unit [46]. 

Non-peptidic tachykinine antagonists were converted to photoprobe ligands 
by Ward. First, a piperidine derivative, CP-99,994 (Glaxo) was appended with a 
diazirine photophore (6, Fig. 7) to study SP (NKl) receptors [74]. A similar 
modification on a neurokinin A antagonist, SR 48968 (Sanofi) produced a 
photoligand (5, Fig. 7) in order to investigate NK2 receptor proteins [75]. 

N-formyl-peptides (FP) are G-protein coupled receptors and members of the 
phagocyte chemotactic receptor family, which are involved in inflammatory 
processes. N-formyl-peptides are indicators of the presence of bacteria or damage 
to host cells in mitochondria. Receptor binding is giving a signal for infection 
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activating chemotaxis and defense system. The exact binding site of the receptor 
was identified by using an agonist analogue, N^-formyl-peptide photoprobe 
(Formyl-Met-Bpfl-Phe-Tyr-Lys-e-N-fluorescein). The photoaffinity labeled re- 
combinant human phagocyte FP receptor was cleaved with CNBr and the fluo- 
rescent fragments were isolated on an antifluorescent immunomatrix. MALDIE 
MS identified a major fragment with a mass corresponded to the ligand cross- 
linked to a CNBr fragment of FPR residues (83 - 87). Further cleavage resulted in 
the identification of the crosslink to Val-Arg-Lys (amino acids; 83-85). These 
residues lie within the putative second transmembrane spanning region of the 
receptor near the extracellular surface [76]. 

Neuropeptide Y (NPY), a 36-amino acid peptide, is distributed widely in CNS 
and is known as a potent vasoconstrictor. NPY is a neuronal and endocrine 
messenger that acts on several receptor subtypes. Sequence comparison indi- 
cates that Yl, Y4, and Y6 share more similarity with each other forming one 
distinct family. The other two (Y2 and Y5) represent a different subtype. Y2 is 
located primarily in the brain and particularly in the hippocampus. Two photo- 
affinity analogues were prepared containing 3-trifluoromethyl-3H-diazirin- 
3-ylphenyalanine (TMDPhe). The photoreactive amino acid was inserted into 
the 36 (TMDPhe“) and 27 position (TMDPhe^^) and used in labeling recom- 
binant receptors. A radiolabel was introduced via Lys £-amino groups as tritiat- 
ed propionates. After photolysis the cross-linked peptides were detected by 
immunological methods. Two proteins were detected for each cell line: a 58-kDa 
and a 50-kDa. From cDNA estimation a 42-kDa protein was expected, and there- 
fore the difference revealed posttranslational glycosylation. Experiments with 
endoglycosidase confirmed the presumption. The two identified proteins re- 
present different glycosylation level of the Y2-receptor [77]. 

Corticotropin-releasing factor (CRE) is a 41-amino acid polypeptide which 
is thought to synchronize physiological responses to stress. CRE exhibits its 
activity through G-protein-coupled receptors mainly found in pituitary and 
brain. Beside the two known subtypes, a 37-kDa binding protein was also identi- 
fied, not homologous to the known receptors. The contact domain of the 
receptors (CRER-1 and CRER-2) and other CRE binding proteins were studied 
by Spiess et al. with photoaffinity probes. A diazirine photoreactive group was 
connected to the N-terminus of the bioactive peptide together with a tyrosine 
residue for radiolabeling (7). Upon irradiation the photoactivatable CRE labeled 
a 75-kDa protein (CRER-1) from rat brain which, after deglycosylation, resulted 
in a 45-kDa polypeptide. That size was in agreement with the estimated size for 
rat CRER-1 from cDNA coding [78]. 

Parathyroid hormone (PTH) regulates calcium levels in blood and bone 
remodeling. The ‘activation domain’ of that 84-amino acid polypeptide locates 
around the N-terminal (1-34 amino acids). Parathyroid hormone receptor is a 
typical G-protein coupled receptor, which is coupled to both adenyl cyclase/cAMP 
and PLCy/IP 3 /cytosolic Ca^"^ intracellular signaling pathways. In order to 
identify the structural elements involved in the peptide hormone binding and 
signal initiation, Chorev et al. employed a photoaffinity scanning approach. The 
N-terminal amino acids were successively deleted or modified and the new 
N-terminus was replaced for photoreactive Bpa. The most active peptide ana- 
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logue, Bpa*, (norleucine)^’^^, L-2-{naphtyl-alanineY^, Tyr^^jfcPTH- 

(1-34)NH2 specifically labeled Met-425 within the receptor putative contact 
domain (409-437 amino acids) located in the transmembrane helix 6 [79]. 

Vasopressin receptor (VP) is a typical member of the G-protein coupled 
receptors. The knowledge of the hormone binding site is a particular interest. 
Vj activates an adenylate cyclase pathway, which mediates the antidiuretic 
action. A radiolabeled photoreactive VP agonist [^H][Mpa', Lys(Apam)*]VP 
(Apam = azidophenylamidino) was used in a PAL experiment and specifically 
labeled the second extracellular domain. The cross-linked amino acid was identi- 
fied as Thr-102 and Arg-106. The sequence homology of the second extracellular 
domain is in fact highly conserved among the VP receptor subtypes, which most 
likely confirm the presence of the binding site in that region [80]. 



4.2.2 

Light-Induced and Chemosensory Signaling Pathways 

Rhodopsin, the receptor of light, consists of a chromophore ( 1 1-cis-retinal) that 
is covalently bound to apoprotein opsin (37-46 kDa) via a Schiff-base to a 
lysine residue. Light initiates the visual process by the photoisomerization of the 
chromophore from an 11-ds to trans configuration. This transformation alters 
the conformation of the surrounding opsin moiety. In that altered state, called 
metarhodopsin, the conformational change is transmitted to the cytoplasmic 
surface, enabling interaction of metarhodopsin with the G-protein transducin 
and a kinase that phosphorylates metarhodopsin. Activated transducin stimu- 
lates the exchange of GDP to GTP at the a-subunit. The GTP-bound a-subunit 
in turn activates cGMP-specific phosphodiesterases, which reduces the level of 
cGMP, causing the cGMP-activated Na'^-channel to close. The other G-protein 
induced pathway is the PLCy/IPj/Ca^"^ signaling cascade in different species. The 
termination of the signal can be achieved by the light-activated GTPase activity 
and alternatively a soluble 48-kDa protein binds to the phosphorylated meta- 
rhodopsin, and interchanges phosphorylation. That light-induced signal trans- 
duction was extensively studied by PAL. Since only activated G-proteins are able 
to bind GTP, photo crosslinking can selectively distinguish the light-dependent 
transducin a-subunit from other G-protein guanosine phosphate-binding sub- 
units not involved directly in the transmission. In an early study a photoreactive, 
enzymatically non-cleavable GTP analogue, azidoanilido-GTP (8, Fig. 8), was 
used as an efficient photolabel for the G-protein [81 ]. Preillumination with a blue 
light, which rapidly initiates the conversion of rhodopsin to metarhodopsin, was 
followed by irradiation with an intense UV light. Two proteins were identified, a 
41 -kDa and a 39-kDa protein. When the preillumination was carried out with 
red light, suppressing the photoisomerization of the retinal only a 39-kDa 
G-protein subunit was detected, clearly showing the direct involvement of the 
41-kDa polypeptide in the photosignaling process. 

GTP-bound a-subunit activates a cGMP-specific phosphodiesterase as de- 
scribed briefly earlier. A benzophenone derivative of cGMP was prepared and 
found to inhibit several types of cyclic nucleotide phosphodiesterases at low 
concentrations. In rod outer segment preparations, the [a-^^P]-tagged probe 
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Fig. 8. Photoaffinity probes for studying light-induced and chemosensory signaling pathways 
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specifically labeled the light-stimulated cGMP phosphodiesterase, opening 
the possibility of using this probe for identification and active site mapping of 
cyclic nucleotide-binding protein or enzymes [82]. 

A tritiated, photoreactive, diazo ketone analogue of 11-cis-retinal (9, Fig. 8) 
was used to study the location and orientation of the chromophore in rhodopsin. 
The 11-cis-double bond was trapped by building it into a six-membered ring 
in order to prevent light directed isomerization. Experimentally first the 
retinal analogue was incubated with bovine opsin for 24 h, and allowed to form 
rhodopsin in a ratio of 1:0.65 to avoid random labeling of the non-bound 
retinal. The photoaffinity labeling was carried out for 10 min at 254 nm, and then 
the Schiff-base linkage of retinal was chemically cleaved. The released retinal- 
derived radioactive tag was identified to modify Trp-265 and Leu-266 in the 
a-helix. According to the current receptor model (Fig. 9), retinal is linked to 
Lys-296 in the seventh (G) a-helix in the middle of the lipid bilayer. The present 
photoaffinity results confirmed that the entire chromophore resides near the 
center of the bilayer, with the polyene’s long axis angled only slightly relative to 
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Fig. 9. Photocovalent attachment to rhodopsin 
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the membrane plane. This position leaves the chromophore accessible to absorb 
the light easily [83, 84]. In a previous study when photoisomerizable photo- 
probes were used, both the C and F transmembrane a-helix were labeled. The 
comparison of the two labeled regions gives a clear picture of the motion of the 
photosensor small molecule in rhodopsin [85]. 

Nakanishi also studied bacteriorhodopsin, which is a 26-kDa hydrophobic 
protein found in the outer membrane of Halobacterium halobium. It functions 
as a light-driven proton pump. It is composed of a 248-amino acid protein and 
an a\\-trans retinal is anchored to Lys-216 through a Schiff-base. Upon light- 
absorption the chromophore undergoes a trans —> cis isomerization about the 
C-13,C-14 double bond and deprotonation/reprotonation of the Schiff base. The 
latter event is directly involved in the proton pumping. The orientation of the 
chromophore can provide information, to which direction the proton transfer 
takes place beginning at the Schiff-base. The determination of the orientation of 
C-9, C-13 methyl groups, which are opposite to the location of the proton 
studied, can solve the problem. A C-10 tethered (13 A long) photoaffinity ana- 
logue (10, Fig. 8) was synthesized with a tritiated, terminal aryl azide. The major 
crosslinking site was identified as Arg-175 and Asn-176 of a-helix F, which 
revealed that the C-9 methyl group is oriented towards the extracellular space, 
the /?-ionone ring was tilted at 27° and the chromophore’s plane is closely per- 
pendicular to the plane of the membrane [83]. 

Transducin is a typical a-subunit of the light-activated G-protein containing 
four functional sites: rhodopsin binding, GTP binding, cGMP phosphodiesterase 
activation sites, and jSy-subunit contact sites. The structural organization of 
these functional sites in the tertiary structure was identified by photoaffinity 
cross-linking. A photo activatable heterobifunctional cross-linking reagent 
M-(3-iodo-4-azidophenylpropionamido)-S-(2-thiopyridyl)cystein (11, Fig. 8) 
was attached to cystein units in transducin. Within the a-subunit only two 
cysteins (in different domains: Cys-210 or Cys-347) were conjugated with the 
crosslinking reagent but, separately, bis-conjugated product was not detected. 
Any of the modification inhibited the rhodopsin catalyzed activation of that 
G-protein. After irradiation the Cys-347 azide derivative was able to cross-link 
to the 12-kDa intramolecular domain in the a-subunit, which otherwise con- 
tains the second cystein (Cys-210). The modification prevented the replacement 
of GDP to GTP. The result indicates that conformational changes within the 
a-subunit could be readily transmitted between the two functional domains 
[86]. In a more recent study, Resek prepared rhodopsin mutants that preserved 
only a single reactive cystein residue per rhodopsin molecule at 65, 140, 240, or 
316 position on the cytoplasmic face; all the other cysteins were replaced for 
serine. A carbene generating diazirine was connected via a cleavable disulfide 
group (12, Fig. 8) to each of the rhodopsin mutants. Visible light activation 
formed metarhodopsin II, which caused rhodopsin to bind to transducin. After 
UV irradiation (355 nm) only the Cys-240-linked photophore (in the third cyto- 
plasmic loop) was able to connect covalently the a-subunit to transducin. All the 
other crosslinks were found to be intramolecular interactions [87]. 

NAD"^ is a substrate for pertussis toxin, allowing the transfer of an ADP-ribose 
derivatives to G-proteins. Ruoho used ['^^I]-iodoazidophenylpropionyl-NAD+ 
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(13, Fig. 8) and 2-azido-NAD+ to probe G-proteins using that toxin induced 
transfer methodology. 

The aryl azido derivatives of [^^PINAD"^ were employed to site-specifically 
incorporate photoactivatable ADP-riboses to transducin at Cys-347 by means of 
pertussis toxin, and the conjugation was followed by irradiation. Photocross- 
linking revealed three major bands after irradiation which corresponded to 
a-y (47 kDa),a-a (83 kDa),and a-a-a ( 105 kDa) subunit contact domains [88]. 

Vertebrate olfactory receptors also belong to the family of G-protein coupled 
receptors. Olfactory receptor families have the capacity to recognize several 
hundreds of different odorants. Filial (14, Fig. 8) is one of the general odorant 
ligands which activates a specific receptor subtype and induces an intracellular 
signal, which leads to Ga^"^ mobilization via IP3 second messenger release. The 
olfactory receptor was overexpressed in bacterial systems. Since isolated recom- 
binant receptors have no biochemical or cellular responses, these proteins can 
be easily characterized by PAL. A photoreactive lilial analogue (15, Fig. 8) was 
prepared and the photoprobe specifically labeled the recombinant receptors 
(31 kDa) with high efficiency confirming the presence of the correct poly- 
peptide [89]. 



4.2.3 

Studying Interactions at G-Protein Level 

If a receptor is coupled to multicellular processes, a drug which would act at the 
receptor/G-protein interface would deliver a remarkable specificity - that is why 
so much importance is attached to studying such interaction at the molecular 
level. 

Photoreactive GTP analogues have proven beneficial for studies of G-proteins, 
since using such probes can not only lead to the right G-protein being identified, 
activated by the agonist, but also information about the environment of the GTP 
binding site can be assessed. 

Gag-proteins are widely distributed in GNS. Three subtypes Gaoi, Gao2> and 
Gaos are involved in the muscarinic receptor Ca^"^ channel signaling machinery. 
Anis studied that system in rat brain with [a^^P]GTP-azidoanilide (8) described 
earlier. 

Gonsiderable evidence has been gathered showing that, in synaptoneurosome 
membrane, depolarization induces the activation of G-proteins. This finding was 
confirmed by PAL, when synaptoneurosomes in depolarized and repolarized 
states were incubated and then photolized separately with [a^^P]GTP-azidoani- 
lide. Two subtypes, Gagi and Gags, showed five- to sevenfold enhanced photo- 
covalent labeling in the depolarized state relative to the polarized. This indicat- 
ed an enhanced exchange of GDP to GTP (or its analogues) [90]. 

The membrane-bound a2-adrenerg receptor is coupled to its cytoplasmic 
effector enzyme, through G-proteins. Site-directed mutagenesis studies 
confirmed that the third or second cytoplasmic loops of the transmembrane 
receptor and the G-terminal tail are important in contacting to G-proteins and 
transmitting the signal. In order to determine the receptor contact site of the 
G-protein responsible for receiving the signal, Taylor designed a photoaffinity 
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probe that contained a 14-amino acid portion from the receptor. This “Peptide Q” 
( G-protein activator peptide) derived from the third cytoplasmic loop (362 - 373) 
and contained an additional cystein C-terminal residue. Peptide Q itself proved 
an efficient inhibitor of the signaling system when a 2 -adrenerg agonists were 
employed. On the other hand Peptide Q activated GTPase in purified G-proteins, 
confirming its receptor mimic action. A photoreactive diazoketone was con- 
jugated to the cystein C-terminal of Peptide Q and the photoprobe (16, Fig. 10) 
was used as a photoaffinity label for the G-protein. The diazoketone derivative 
of “Peptide Q” specifically labeled the Go-subunit of the G-protein [91]. 



4.2.4 

The PLC/IPj/Ca^^ Signaling Cascade 

The phospholipases (PLC) isozymes cleave the phosphodiester bond in phos- 
phatidyl-inositol-4,5-bisphosphate (PIP 2 ) releasing two second messenger mole- 
cules inositol 1,4,5-trisphosphate (IP3) and diacylglycerol (DAG) as shown before. 
The /J-isozyme are controlled by the G„ or G^_j, subunits of the heterotrimeric 
G-proteins coupled to surface receptors. The y-isozymes are substrates for tyrosine 
kinases, such as growth factors. 

The recently discovered PLC-dj is activated by a new class of G-proteins 
which are also linked to cell surface receptors. The PIP 2 binding site of the 
enzyme was studied by a photoaffinity analogue of IP 3 ([^H](benzoyldihydro- 
cinnamyl)-aminopropyl-IP 3 , 17a, Fig. 10) where the benzophenone-containing 
unit efficiently imitated the apolar acyl chain of PIP2. After succesful photo- 
labeling limited digestion separated the major domains. Careful radioactivity 
analysis revealed that the majority of the radiolabel was recovered in the pleckstrin 
homology (PH) domain instead of the catalytic region as initially anticipated. 
PH domain is a highly conserved sequence (130-140 amino acids) found origin- 
ally in platelet protein pleckstrin and in several signaling proteins. They form a 
characteristic pseudo-^-sandwich containing a carboxy terminal ^-helix. The 
labeling pattern showed that PIP 2 binds the PH domain, which orientates the sub- 
strate to the catalytic site in the best conformation for the efficient binding [92]. 

The PLC-y-produced inositol 1,4,5-trisphosphate is a common second 
messenger, which plays an important role in the intracellular Ca^+ mobilization. 

IP 3 receptors are located in the membrane of endoplasmic reticulum as a 
homotetramer forming a Ca^^ channel. The 260-kDa monomer contains eight 
transmembrane regions and the majority of the amino acids are cytoplasmic. 
That outer portion contains the putative IP3 binding site at the N-terminal tail, 
while additional binding sites are also identified; two for ATP binding and 
another two for c AMP-dependent protein kinase interaction [22]. Studies on 
localization IP3 binding sites were allowed by the above-mentioned photo- 
activatable analogue of IP3 (17 a. Fig. 10). PAL helped to identify a short sequence 
in the putative binding region, previously localized to the cytoplasmic N-terminal 
1-788 region by mutagenesis studies. 

Ryanodine receptors have a high-degree of similarity to IP3 receptors and they 
are responsible for Ca^"^ influx from sarcoplasmic reticulum of skeletal and 
cardiac muscles. The ryanodine receptor also forms a very large homotetramer. 
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Fig. 10. Photoaffinity probes for studying G-protein interactions; PLC/IP3/Ca^'^ signaling 
pathways, and the “arachidonic acid cascade” 
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with a monomer subunit of 565 kDa. A tethered photoaffinity analogue 
([^H]-10-O-[3-(4-azidobenzamido)propionyl]ryanodine, 18, Fig. 10) efficiently 
labeled the 565-kDa protein that was previously identified. The labeled 76-kDa 
tryptic fragment was recognized with an antibody raised against the COOH- 
terminal. It confirmed the exact location of the high-affinity binding site for 
ryanodine [93]. Another group described the synthesis of a similar probe with a 
radioiodinated form [94]. 



4.2.5 

PLA Pathways: Eicosanoids and their Targets 

In an other signaling pathway, which also utilizes PIP 2 , membrane-bound 
PLA cleaves the ester groups between the acyl moieties and most frequently 
arachidonic acid is released. That 20-carbon and four double-bond-containing 
fatty acid results in about 20 major and almost 100 different metabolites with 
strikingly different biological activities. The series of events are collectively 
called the ‘arachidonic acid cascade’ and the bioactive products of the metabolic 
pathways are local hormones called eicosanoids. The major subclasses of 
eicosanoids are prostaglandins, leukotrienes, thromboxanes, lipoxines. These 
local hormones are released near the membrane interface, allowing these mole- 
cules to migrate to neighboring cells and bind primarily to G-protein coupled 
receptors. Recently it was also shown that these compounds can also act on 
intracellular receptors by translocating the 5-LO enzyme from the membrane 
interface. 

The biosynthesis of eicosanoids utilizes several enzymes till the ultimate bio- 
active ligand is obtained. The literature is very rich in PAL studies on these 
enzymes and the local hormone receptors; therefore only some major results on 
the key biotransformations and receptors are represented here. 

Prostaglandin Endoperoxide Synthase- 1 (or cyclooxygenase, COX-1 ) converts 
arachidonic acid to prostaglandin endoperoxides. The enzymatically unstable 
intermediate is the precursor of prostaglandins, including the inflammatory 
factor PGEs, and prostacyclin/thromboxane components, which regulate blood 
platelet aggregation. In addition, PGES-1 (COX-1) enzyme is the common mole- 
cular target for the non-steroid anti-inflammatory drugs (NSAIDs) like aspirin, 
ibuprofen, etc. Structurally, COX-1 is an integral membrane protein; however, it 
lacks transmembrane domains. The topology of the membrane associated region, 
which is an amphipathic helix, was postulated to contain the catalytic site. That 
region was identified by an apolar diazirin probe (3-trifluoro-3-[m-iodophenyl]- 
diazirine, TID, 19, Eig. 10). The apolar membrane photoprobe caused broad label- 
ing on the protein region embedded into the membrane and it was eliminated by 
standard NSAIDs as anticipated [95]. 

The other major arachidonic acid (A A) converting enzyme is an integral 
binding protein, 5-lipoxygenase, which is responsible for the initial transforma- 
tion in a cascade of events towards the biosynthesis of leukotrienes. Leukotrienes 
are major mediators of numerous biological processes, including chemotaxis, 
and are implicated in hypersensitivity disorders like asthma. It was discovered 
in the early 1990s that another protein is necessary for the cellular synthesis of 
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leukotrienes, called 5-lipoxygenase activating protein (FLAP). A radiolabeled 
arachidonic acid analogue ['^^I]-L739,059 (20, Fig. 10), was synthesized by 
Wang et al. [96] and used in PAL studies of human recombinant FLAP. The 
AA analogue efficiently labeled the 17-kDa FLAP, and the labeling was 
eliminated by preincubation with A A and a known leukotriene biosynthesis 
inhibitor, MK-886. These results suggest that FLAP may activate 5-LO by 
specifically binding AA and transferring that substrate to the enzyme catakytic 
site [97]. 

Prostaglandin receptors, including prostacyclin (PGI2), are all G-protein 
coupled receptors, which activate cAMP cyclase, resulting in cAMP. PGI2 receptors 
were identified in mouse mastocytoma P-815 cells (43 kDa) by a photolabile 
analogue, [15-^H]-19-(3-azidophenyl)-20-norisocarbacyclin (21, Fig. 10). The 
photoprobe also labeled a 45-kDa band in membranes of procine platelets [98]. 

Leukotriene has been implicated in inflammatory processes and chemo- 
taxis. The G-protein-coupled receptor in human myeloid cells was postulated to 
possess two distinct binding sites. Photoaffinity studies using unmodified 
tritiated LTB4 as photoreactive species labeled two different proteins in the pre- 
sence (53 kDa) and in the absence of a non-cleavable GTP analogue (56 kDa). 
This means that stabilization of the G-protein in the GTP-bound state resulted 
in an interconversion of the first high affinity binding site to an alternative low- 
affinity binding site [99]. 

Cysteinyl leukotrienes (LTC4, LTD4, LTEJ are derived from the biotrans- 
formation of an unstable epoxide precursor (LTA4), which is produced from 
arachidonic acid in a 5-LO catalyzed process in the presence of FLAP. LTD4 
receptors also belong to the family of G-protein coupled receptors. First, an 
azido-LTD4 derivative was used in PAL experiments [100], and then a diazirine- 
linked MK-0476 antagonist analogue photoprobe (22, Fig. 10) was applied in 
further studies. In both cases a 43-kDa protein was labeled; however, the 
diazirine group was better suited for the G-protein coupled receptors, since 
these proteins are sensitive to wavelengths lower than 360 nm [101]. 



4.2.6 

Studying Unusual G-Proteins 

Microtubules have a key role in mitosis and cell-proliferation. They are dynamic 
assemblies of hetero dimers of a and jS tubullin. In the cell-reproduction cascade 
tubulin polymerizes fast and subsequently depolymerizes. Tubulin dimers are 
unusual guanyl nucleotide binding (G) proteins, which bind GTP reversibly at a 
site in the /1-tubulin. GTP irreversibly hydrolyzes to GDP during polymerization. 

Rajagopalan reported a bidentate GTP photoaffinity probe to investigate GTP 
binding site on tubulin, combining two mechanistically different photoreactive 
groups, aryl azides and benzophenones (23, Fig. 11). Sequential photochemical 
activation of the noncovalent complexes gave cross-linking patterns different 
from any of the monodentate probes. If the GTP probe-tubulin complex was 
briefly irradiated at 254 nm to activate aryl azides, the covalent attachment was 
detected in the /1-subunit. On the other hand, prolonged photolysis at 360 nm led 
to incorporation of benzophenone into the a-subunit [102]. 
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Fig. 11 . Photoaffinity probes for studying a and /?-tubulin 
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The dynamic instability of the microtubules was blocked by several anti- 
mitotic agents, but according to the mechanism of action two major classes can 
be distinguished. The first type of antimitotic agents suppresses the poly- 
merization of microtubules, while others like Paclitaxel inhibit depolymeriza- 
tion and freeze the tubulin in the polymerized state. Several groups prepared 
photoaffinity analogues of paclitaxel. In earlier studies forskolin (24, Fig. 11), 
maytansine (25, Fig. 11), and colchicine (26, Fig. 1 1 ) affinity probes labeled both 
a and /1-tubulins (55 kDa and 52 kDa) with a labeling preference to the /1-sub- 
unit [103-105]. Azidophenyl [106] vinca-alkaloid probes (27 a, b. Fig. 11) were 
also developed and it was found that they largely retained the biological activity 
as tubulin polymerization inhibitors. A similar benzophenone probe was also 
prepared (G. Dorman, I. Ujvary, G.D. Prestwich, unpublished results) and exhibit- 
ed good inhibitory activity. Horwitz first reported the identification of an amino 
acid sequence from paclitaxel binding domain in ^-tubulin. When the azido- 
phenyl group was placed in the isoserine sidechain in paclitaxel (28b, Fig. 11) 
a 32-amino acid sequence was labeled (1-31) in the N-terminus [107]. 

A benzophenone analogue in the same position (28 a. Fig. 11) efficiently label- 
ed both subunits; however, that probe found more relevant use in identifying 
paclitaxel binding domains in glycoprotein-P [108]. Another photoprobe con- 
taining the azidophenyl group in the 2-position of the baccatine core (28c, 
Fig. 11) labeled the 217-231-amino acid portion in /1-tubulin [109]. Very 
recently Ojima et al. reported paclitaxel photoprobes where the benzophenone- 
moiety was attached to the 7- and 10-position. The 7-substituted analogue 
showed particularly good activity in inhibiting tubulin depolymerization [110]. 

Finally a French group developed a novel taxoid photoprobe [111], which 
allowed the first identification of an amino-acid sequence in the a-subunit 
(281-304) together with expected labeling in the jS-subunit (217-229). 



4.2.7 

Protein Kinases Involved in G-Protein Coupled Pathways 

The covalent modification of cellular proteins by phosphorylation of serine/ 
threonine and tyrosine residues provides an efficient molecular switch for alter- 
ing cellular responses. 

Phosphorylation plays an important role in several processes of the cell life- 
cycle including cell proliferation, division, and apoptosis. 

Reversible protein phosphorylation occurs by means of protein kinases. 
Eukaryotic protein kinases comprise a 250-amino acid, conserved catalytic 
domain. They are widely exploited in the regulation of physiological processes; 
according to estimates the mammalian genome contains over 2000 PK genes. 

Protein kinases have a general architecture: catalytic domain, a binding 
domain that orients the substrate to the catalytic site, and phosphate donor 
binding site which donates the y-phosphate to the acceptor hydroxyl residues. 

There are several subfamilies of PKs, which either require Ga^"^, diacyl glycerol 
or phorbol esters or extracellular agonists for activation. Four major groups 
of PKs are distinguished. Basic amino acid-directed kinases (PKA, PKB, PKG) 
phophorylate serine/threonine around such amino acid residues. PKA is activat- 
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ed by cAMP upon stimulation of G-protein-coupled receptors and comprised of 
heterotetrameric complexes of two catalytic and two regulatory subunits. PKC^ 
needs membrane-bound phospholipids for activation. 

CaMK kinases are Ca^'^/calmodulin regulated kinases, while CMGC Pks 
collect qrclic-dependent, mitogene-activated peptide (MAP) kinases, GSK3 and 
Clk kinases, that phosphorylate substrates in proline-rich domains. The last 
group are PTKs (protein tyrosine kinases), which include both receptor and 
non-receptor kinases, which phosphorylates tyrosine residues (this will be dis- 
cussed in Sect. 4.3). 

Protein kinases, in cooperation with other proteins, form multiprotein 
complexes which are susceptible to activation upon external agonist stimuli. 
According to different functions in cell-cycle regulation, the conformational 
changes are initiated by autophosphorylation and dimerization transmitted 
by the previously discussed second messengers: cAMP, cGMP, IP3, PIP3, AA 
and DAG. 

Intracellular signaling cascades provide complex networks and inter- 
connect protein systems for cell function regulation relies on phosphorylation/ 
dephosphorylation of specific proteins. Since large numbers of ligands, receptor 
agonists, and messengers are involved in signaling processes using protein 
kinase-mediated pathways as transmitters, several very different probes were 
investigated. 

Vitamin E (tocopherol), which is a major lipid-soluble antioxidant, inhibits 
the differentiation of smooth muscle presumably by modulating the activity of 
protein kinase C. PKC inhibition by tocopherol has also been implied in human 
platelets. 

A different set of photoaffinity probes (29, Fig. 12) was designed and synthes- 
ized by Atkinson et al. The photosensitive 4-azido-2,3,5,6-tetrafluorobenzyloxy 
group was placed at the terminus of the phytyl side chain [112]. 

It has been found that the catalytic activity of PKG is enhanced by a lipid 
component of the cell membrane, namely phosphatidylserine. This activity is 
further stimulated by sn-l,2-diacylglycerol. Oleic acid also activates the enzyme 
in the presence of 1,2-diacylglycerol, and thus it is presumed to mimic phos- 
phatidylserine. In order to identify that modulating binding site for oleic acid on 
PKG, a photoaffinity analogue was devised. A carbene generating photophore, 
diazirine was placed in the apolar terminus of the unsaturated fatty acid ligand 
(30, Fig. 12). The synthesis and the photochemical activation properties were 
reported by Riihmann and Wentrup [113]. 

Phorbol esters are known as powerful tumor promoters and activators of 
PKG. The enzyme also needs physiologically 1,2-diacylglycerol for activation. 
Shibasaki et al. reported the preparation of a photolabile phorbol derivative that 
contained the photophore at the G-13 position (31, Fig. 12) and a fluorescent 
label. The acyl groups of phorbol esters are considered to be critical in their 
binding activity and mimicking DAG. In fact, the specific photolabeling localiz- 
ed the enzyme (82 kDa) and showed that the G-13 ester is in close proximity to 
the target protein in the bound state [114]. Wender et al. synthesized another 
phorbol ester probe, placing the photophore at the G-3 position. The diazo ester 
probe (32, Fig. 12) bound to PKG with high affinity [1 15]. Other tumor promoters 
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Fig. 12 . Photoaffinity probes for studying protein kinases 
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are also believed to act via activating the PKC pathway. A series of arylazide 
teleocidin photoprobes (33, Fig. 12) were designed and synthesized by Irie et al. 
The biological activity of the photoligands was in the same range as the un- 
modified parent compound [116]. 

Protein kinase casein kinase (CK2) is found in cytoplasm and the nucleus of 
eucaryotic cells and is elevated in many types of cancer. Unlike most serine/ 
threonine kinases they are not activated by specific stimuli, but are essentially 
constitutively active. They autophosphorylate, which causes inactivation, and 
various protein phosphatases are able to recover their activity. It forms a hetero- 
tetramer, with an a-subunit (35-44 kDa) and /1-subunit (24-29 kDa). The 
a-subunit displays the catalytic activity while the jS-subunit contains a binding 
site for the substrate which is self-phosphorylated by the a-subunit. That is 
required for optimal CK2 activity of the a-subunit. A number of protein sub- 
strates are phosphorylated by CK2, including several nuclear oncoproteins 
including p53. CK2 and p53 associate in a tight complex that involves the ^-sub- 
unit of the enzyme. The regulation of this enzyme in these processes is still 
poorly understood. Interestingly, CK2 is increasingly activated by polycationic 
systems, and spermine was found to be the most active. A photoaffinity ana- 
logue of spermine (34, Fig. 12) efficiently labeled the /1-subunit as a primary 
binding site, but a minor labeling was detected in the a-subunit (catalytic site). 
The close proximity of the subunits has been proved and it was also confirmed 
that the binding site in the /1-subunit functioned as orientating the substrate 
towards the catalytic site [117]. The binding site on the /1-subunit was mapped 
and the location of the covalent attachment was identified as Thr-72, while in the 
a-subunit the probe labeled Leu-220. 



4.2.8 

Transcription Factors 

As was mentioned earlier, DAG activates protein kinase C, which phosphorylates 
transcription factors like NF/cB: nuclear transcription factor. NFkB forms a 
multisubunit complex with an inhibitory subunit which is phosphorylated by 
PKC. The complex disintegrates and what is released translocates to the nucleus 
and initiates gene transcription. NFkB is a heterodimer, with two distinct 
DNA-binding subunits: 50 kDa and 65 kDa, both being members of the Rel 
transcription factor family. These proteins have an important role in the signal- 
ing cascade of the cellular defense system, and activate numerous genes in res- 
ponse to pathogens or inflammatory cytokines. 

Rel proteins contain a 280-amino acid homology region, which mediates both 
DNA binding and dimerization. DNA contact regions are located in the amino- 
terminal half and the residues responsible for the dimerization reside in the 
carboxy terminal half. 

A photoreactive nucleotide, 8-bromo-2'-deoxy-adenosine, was incorporated 
post-synthetically into a DNA sequence (35, Fig. 13) within the previously 
determined DNA contact site for the transcription factor. Upon irradiation the 
resulting nitrene cross-linked primarily to the 50-kDa subunit and covalently 
modified Lys-244. On the other hand, that amino acid residue appeared not 
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Ac-Asp-Ala-Asp-Glu-Tyr992(PO 3H2)-Spa-lle-Pro-Gln-Gln-Gly 36a 
Ac-Asp-Ala-Asp-Spa-Tyr992(P03H2)-Spa-Leu-Pro-Gln-Gln-Gly 36b 
Ac-Phe-Leu-Pro-Val-Pro-Glu-Tyr1068(PO 3H2)-6pa-Asn-Gln-Ser-Val 36c 

Bpa-Asn-GIv-Asp- TvrfPO ^H^l-IVIet-Pro-Met-Ser-Pro-Lvs-Ser 

Glv-Bpa-Glv-Asp- Tvf(P0 3H2)-Met-Pro-Met-Ser-Pro-Lvs-Ser 

Gly-Asn-Bpa-Asp- TvrtPO iHpl-Met-Pro-Wlet-Ser-Pro-Lvs-Ser 

GIv-Asn-GIv-Bpa- TvrtPO ^H^i-IVIet-Pro-Met-Ser-Pro-Lvs-Ser 

GIv-Asn-GIv-Asp- TvriPO ^H^i-Bpa-Pro-IVIet-Ser-Pro-Lvs-Ser 

Glv-Asn-Glv-Asp- Tvr(P0 3H2)-l\/let-Bpa-IVIet-Ser-Pro-Lvs-Ser 

Glv-Asn-Glv-Asp- Tvr(P0 3H7)-l\/let-Pro-Bpa-Ser-Pro-Lvs-Ser 

Glv-Asn-Glv-Asp- Tvr(P0 3H2)-IVIet-Pro-Met-Bpa-Pro-Lvs-Ser 

GIv-Asn-GIv-Asp- TvriPO ^H^i-Met-Pro-Met-Sef-Bpa-Lvs-Ser 

Glv-Asn-Glv-Asp- Tvr(P0 3H2)-Met-Pro-IVIet-Ser-Pro-Bpa-Ser 

Glv-Asn-Glv-Asp- Tvr(P0 3H2)-Met-Pro-IVIet-Ser-Pro-Lvs- Bpa 

37a-37k 



Fig. 1 3. Photoaffinity probes for studying transcription factors, inositol-polyphosphate binding 
proteins, and biopolymers containing an SH2 domain 
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to contact with a DNA base, but rather a phosphate moiety that lays in close 
proximity [118]. 



4.2.9 

Inositol-Polyphosphate Binding Proteins 

Profillin is a 14-kDa protein which binds actin monomers, proline rich 
sequences, and PIPj. The exact consequence of these bindings at cellular level is 
still mainly unresolved. It has been suggested that profillin possess a regulatory 
role in the signaling pathways; it links transmembrane signal transduction to 
the reorganization of the actin cytoskeleton required for cell motility, and it also 
participates in the PIPj cleavage pathway. Profillin binds to the inner side of the 
cell membrane in contact with PIPj residues. Receptor activation, which makes 
PIP 2 fall apart and leave the membrane, mediates the translocation of profillin 
to the cytosolic fraction. On the other hand, profillin is able to inhibit PLC-yl. 
Benzophen one-based photoaffinity analogues of PIP 2 (35a, b,Fig. 13) were used 
to localize the binding site on profillin. Upon irradiation, quite interestingly, 
Ala-4 was labeled in the N-terminus helix. Based on that result the bis-phosphate 
binding domain was modeled at Arg-135 and Arg-136 close to the carboxy- 
terminal helix. In fact PIP 2 seemed to stretch out between the terminal region, 
remaining in a fairly open position in the periphery of the protein [119]. 



4.3 

Photoaffinity Labeling of Receptor Protein Tyrosine Kinases (PTK) 
and Biomolecular Targets Involved in the Signaling Pathways 

PTKs are monomeric in the active form, and consist of an extracellular binding 
domain, single membrane-associated a-helix and cytosolic domain with the 
kinase activity. Ligand binding induces dimerization and autophosphorylation 
that enables binding to adaptor molecules. These molecules connect the receptor 
to the signal transmitting biopolymers. 

Growth factor activation of PTKs induces binding the receptor to Grb2 
(growth factor receptor binding protein 2), containing SH2 (sarcoma homology) 
and 2 SH3 domains. In turn it localizes SOS protein to the plasma membrane. 
SOS activates Ras by exchange of GTP to GDP. Ras-GTP binds directly to serine/ 
threonine kinases and activates the MAP kinase signaling machinery leading 
to increased gene expression. Normal Ras proteins play an important role in 
growth factor initiated tyrosine kinase receptor signaling and are considered as 
central switches used by cells in cell-proliferation, cell-differentiation and other 
genetic programming. Ras gene and the encoded Ras G-protein are closely as- 
sociated with carcinogenesis. Mutations of the ras gene contribute to 20-30% 
of all human cancers. In cells in which the ras gene is mutated, the G-proteins 
can be perpetually “turned on,” continuously giving the signal for cell division 
even in the absence of growth factors. 
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4.3.1 

Studying Biopolymers Containing an SH2 Domain 

Sarcoma homology 2 domains (SH2 domains) are discrete phosphotyrosine 
binding modules (composed of about 100 amino acids) found in a wide variety 
of cytoplasmic signaling proteins, mediating the physical association. Two 
classes of SH2 domains exist with a major difference in the neighboring amino 
acids next to the phosphotyrosine-binding residue. SH2 domains receive signals 
from the autophosphorylated tyrosine kinase receptors activated by agonist 
binding. SH2 domains transmit the signals leading to activation of key enzymes 
in the signaling cascade. 

Another representative of PTK signaling is the insulin receptor that is a 
membrane-spanning glycoprotein linked tyrosine kinase and present on the 
surface of insulin-responsive tissues. The receptor autophosphorylates and also 
phosphorylates insulin receptor substrate-1 (IRS-1), which serves as a docking 
site for proteins with SH2 domains. In order to localize the interaction between 
insulin and its receptor, Shoelson envisaged an insulin agonist photoprobe that 
would cross-link to the receptor with high efficiency and activate permanently. 
Based on previous studies, which suggested that Phe-25 (in B-chain) is crucial in 
receptor recognition and para-substituents of this amino acid were well tolerat- 
ed, he designed a photoprobe with modified C-terminal octapeptide of the B 
chain. This new sequence contained a radio- and biotin-label and a photoreac- 
tive amino acid, Bpa. This photoaffinity ligand labeled a single 135-kDa peptide 
with an average 80% cross-linking efficiency. The covalent receptor-ligand 
complex resulted in tyrosine kinase activation in vitro and in intact cells insulin 
receptor phosphorylation and internalization were both activated [120]. 

Miller studied the interface between tyrosine-kinase type ofEGFR ( epidermal 
growth factor receptor) autophosphorylation sites (Tyr-992, 1068, 1086, 1148, 
1173) and phospholipase Cyl N-terminal SH2 domain. PLCyl has two SH2 
domains at the N-terminal and at the C-terminal. Upon interaction with phospho- 
peptide sites of the SH2 domains the tyrosine kinase acts on the enzyme 
phosphorylating it, which finally turns on the enzyme activity and results in the 
release of two second messengers, IP3 and diacylglycerol. The initial peptide- 
peptide interaction was examined by PAL, using the benzophenone-based 
photoactivatable amino acid, Bpa. The photoreactive amino acid was incorporat- 
ed into different phosphotyrosine-containing 12-mer peptides (36a-c, Fig. 13) 
derived from the major autophosphorylation sites of EGFR (Tyr-992 and 
Tyr-1068). The photoaffinity experiment resulted in efficient cross-linking to 
the SH2 domain with the small peptides; Tyr-992-derived peptide was covalently 
attached to Arg-562 and the Tyr-1068-derived peptide cross-linked to Leu-653. 
Based on these results a three-dimensional model was constructed predict- 
ing the entire contact site between the SH2 domain and the model phospho- 
peptides [121], 

A similar study was conducted by Shoelson. Phosphatidyl inositol 3-kinase is 
activated by contact with autophosphorylated tyrosine kinase receptors follow- 
ed by agonist binding to the cell-surface receptor. The truncated regulatory 
subunit of the enzyme (p85), which contains an SH2 domain, retained its full 



210 



G. Dorman 



binding activity. Williams and Shoelson studied the phosphotyrosine-binding 
region of the enzyme by photoaffinity scanning [122]. A series of IRS-l-related 
(insuline receptor subtrate- 1 ) 12 -mer phosphopeptides was prepared in such a 
way that each amino acid in the sequence was successively replaced by the 
photo activatable Bpa (37a-l, Fig. 13). When the replaced amino acid was 
adjacent to the phosphotyrosine residue, efficient cross-linking was observed at 
Gln-83 of the SH2 domain within the a-helix-I, while Bpa-l-4 substitution led to 
specific cross-linking at Asn-85 within the flexible loop C-terminal to a-helix-II. 

The phosphatidyl inositol 3-kinase hold a heterodimer regulatory domain 
(85 kDa) including SH2 sequence and a catalytic domain ( 1 10 kDa). Its function 
has been widely investigated in the signaling processes and, particularly, the 
resulting second messenger PIP 3 was the subject of several studies. The major 
putative targets were reported as serine/threonine kinase Akt/PKB, protein 
kinase C, etc. In order to promote the identification of those targets a photo- 
activatable analogue was prepared by Prestwich et al. and are currently under 
investigation with several targets [123]. In rat cerebellar membrane using an 
inositol 1,3,4,5-tetrakisphosphate photoprobe, [^H](benzoyldihydrocinnamyl)- 
aminopropyl-inositol 1,3,4,5-tetrakisphosphate ([ 3 H]BZDC-IP 4 , 17b, Fig. 10) 
a 42-kDa protein was identified and named centaurin. In the region where 
the photoprobe cross-linked, a high-affinity PIP 3 binding site was identified by 
photoaffinity competition assays [124]. 



4.3.2 

Studying Protein Prenyiation 

In normal cells, the GDP/GTP-binding proteins, after protein synthesis, move to 
the cell membrane to which they become “hooked” by a hydrophobic farnesyl 
group. The y-subunit is anchored in the membrane by a post-translational 
modification of the C-terminal CAAX sequence (C - cystein, AA - aliphatic amino 
acids, X - methionine). This protein is first enzymatically farnesylated by a 
specific farnesyltransferase, then the AAX part is cleaved by specific proteases 
and finally the cystein residue is converted to a methyl ester. 

Inhibition of the lipid modification cascade provides an alternative way to 
block aberrant signaling pathways and that opportunity can be exploited in 
anticancer therapy. As part of the growth factor, signaling of the false activation 
is transmitted by the mutated ras gene encoded proteins (Ras) and ultimately 
leads to uncontrolled cell growth. These typical GTP binding proteins are also 
subject to membrane anchoring and the biosynthesis of those Ras proteins can 
be blocked at the posttranslational prenyiation step. 

The molecular elements of that pathway were mapped with photoaffinity 
labeling by different investigators. Farnesyltransferase contains a and (3 hetero- 
dimer subunits, and binds to both protein and farnesyl diphosphate. The main 
recognition elements for the protein is the C-terminal CAAX motif. Coleman et 
al. attached two benzophenones to the recognition sequence and the resulting 
photoprobe (38, Fig. 14) specifically labeled both subunits [125]. 

In recent years several FPTase inhibitors were developed as peptidomimetics 
of the key tetrapeptide fragment for use as potential anticancer agents. Miller at 
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Fig. 14 . Photoaffinity probes studying biomolecular targets in protein prenylation 
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Stony Brook synthesized a photoreactive benzophenone containing inhibitor 
(39, Fig. 14) which efficiently labeled the active site of the enzyme. The photo- 
inhibition was prevented by adding native Ras to the reaction mixture. That 
competition indicated that the labeling was specific at the active site. Peptide 
mapping of the labeled enzyme by HPLC, Edman sequencing and MALDI-MS 
allowed the identification of key amino acids in the substrate binding, as 
Asp-110 and Asp-112 in the a-subunit [126]. 

Other research groups attempted to identify the farnesyl diphosphate bind- 
ing site of the enzyme. Allen reported a trifluoromethyl-substituted diazoester 
probe (40, Fig. 14), where the photoreactive group was introduced at the apolar 
end of farnesyl diphosphate. The [^^P] labeled probe upon irradiation specifi- 
cally cross-linked to the ^-subunit (45 kDa) both in the case of farnesyl 
transferase and geranylgeranyl transferase [127]. DiStefano et al. synthesized 
and evaluated other photoreactive analogues of farnesyl or geranylgeranyl 
pyrophosphate containing apolar benzophenone (41, Fig. 14), which replaced 
one geranyl unit. The photoprobes, which partially mimicked the geranyl- 
geranyl isoprenoid unit, preferably modified the ^-subunit of the transferase 
enzyme [128]. 



4.3.3 

Discovery of New Protein Targets invoived in Tyrosine Kinase Signaling 

Benzoquinoid ansamycin antibiotics are extensively studied as antitumor agents 
and as inhibitors of tyrosine kinases. Much evidence was collected showing that 
the enzyme activity was rapidly depleted after treatment with ansamycin anti- 
biotics. Since the exact molecular mechanism of the tyrosine kinase inhibition 
was unclear, an aryl azide photoaffinity analogue (42, Fig. 15) was prepared. The 
erbB2 encoded pl85 tyrosine kinase was effectively depleted by the ansamycin 
analogue hebimycin. The photoaffinity labeling result was unexpected, the 
specific radioactivity incorporation was detected in a 100 kDa protein rather 
than in the pl85. The same protein was labeled in two fibroblast lines transfect- 
ed with the oncogenes erbB2 and v-src. The detailed study confirmed that bind- 
ing of plOO presumably indirectly activates the degradation of pl85 and it was 
postulated that the newly discovered protein was an important regulator for 
tyrosine kinases [129]. 

The mitogen-activated protein kinases (MAPK) are proline-directed Ser/Thr 
kinases that activate their substrates by dual-phosphorylation. These kinases are 
activated in growth factor or cytokine stimulated, as well as stress reaction or 
UV. light induced signaling pathways. Activated MAPKs after nuclear trans- 
location induce phosphorylation of nuclear transcription factors. MAPKs are 
involved in many cellular processes such as cell-proliferation, oncogenesis, and 
cell cycling. They are also implicated in a series of protein-protein interactions, 
which altogether regulate distinct cellular out-puts. One of the diverse cascades 
was discovered using PAL techniques by researchers at SKB. Proinflammatory 
cytokines like interleukin-1 or tumor necrosis factors were successfully inhibit- 
ed with a new series of pyridinyl-imidazoles. Photoaffinity experiments revealed 
that the target (cytokine suppressive drug binding protein - CSBP) for the in- 
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45 

Fig. 15. Miscellaneous photoaffinity probes for studying interactions in tyrosine kinase, 
calmodulin and nitric oxide related signaling 



hibitors was actually a protein kinase. Several initial studies pointed in the same 
direction, the inhibition was at the translational level and particularly at the 
initiation stage. Finally, by means of a PAL the involvement of a protein kinase 
was unambiguously identified. The azidophenyl appended pyridinyl-imidazoles 
photoprobe (43, Fig. 15) labeled an approximately 45-kDa protein. After limited 
digestion with trypsin and cyanogen bromide both the labeled and unlabeled 
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fragments were subjected to sequence homology search. A 15-amino acid tryptic 
fragment showed 85 % homology to a C-terminal portion of a protein belonging 
to a MAPK family. Another sequence derived from an 8-kDa CNBr cleavage site 
that contained the majority of the radiolabel. From sequence reading a unique 
MAP kinase was identified. Based on the sequence a reverse-translated oligo- 
nucleotide portion was synthesized and used in screening the cDNA library 
from human monocyte. Surprisingly, two distinct cDNAs were isolated. The two 
cDNAs as well as the resulting recombinant CSBPs shared only 43% identity, 
while their sizes were almost identical (42 kDa), close to the detected size for the 
natural enzyme. The anti-inflammatory agents inhibited both kinases, confirm- 
ing the presence of the right cloned polypeptides. It is still unclear why the 
kinase exists in two nearly identical forms in monocyte cells exhibiting the same 
cellular function [130]. 



4.4 

Photoaffinity Labeling of Protein Targets in Additional Signaling Pathways 



4.4.1 

Calmodulin Signaling Pathways 

This pathway is discussed separately since the general Ca^"^-ion signaling parti- 
cipates in several cascades. 

Calmodulin (CaM), the principal intracellular calcium receptor, is implicated 
in several signaling pathways. Since CaM pathways can be directly linked to 
nitric oxide generation and immunomodulation, it was rational to discuss them 
earlier. CaM is an acidic protein composed of 148 amino acids and two hand-like 
subunits connected by a tether. This structural arrangement provides sufficient 
flexibility when binding to several different targets. Upon Ca^"^-ion binding CaM 
interacts with the target proteins and induces a conformational change on the 
targets, which results in their activation. The structural features of CaM activa- 
tion have been studied for more than a decade with photo activatable peptides 
containing Bpa. First DeGrado investigated the Ca^"^-dependent conformational 
changes and binding of CaM to model peptide sequences, which derived from 
CaM activated proteins [131]. 

Phenothiazines bind to calmodulin and stimulates the interaction with target 
proteins. In order to map the conformational changes (subunit shifts) in CaM 
upon phenothiazine binding. Watt devised a bidentate photoaffinity ligand (44, 
Fig. 15), which contained a “fast reacting” photophore (aryl azide) and a “slowly 
reacting” photophore (benzophenone). Using different wavelengths upon 
irradiation at 260-280 nm the phenothiazine residues was covalently linked to 
its binding site which resulted in permanent conformational change. The second 
photolytic activation at 360 nm allowed mapping of the activated conformation. 
The photoaffinity reagent also contained a cleavable tartrate linkage, which 
allowed the removal of the primary ligand phenothiazine residue from the 
covalent complex [38]. 
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4.4.2 

Guanylate Cyclase Receptors and Biomolecular Targets Involved in that Signaling 

Guanylate cyclase signaling pathway also utilizes an enzyme-linked receptor 
model. The effector enzyme, guanylate cyclase, converts GTP to cGMP, which in 
turn activates cGMP dependent protein kinase or phosphodiesterases. 

Atrial natriuretic peptide (ANP) is a 28-amino acid endocrine hormone. It 
has several effects: it increases the Na"^ and water excretion by the kidney and the 
vascular permeability, relaxes vascular smooth muscle, and inhibits several 
other hormones. The major actions of ANF are mediated through membrane- 
bound receptors, stimulating guanylate cyclase catalytic activity that is a funda- 
mental part of the receptor. In order to map the ANF binding site in the recep- 
tor a photoaffinity analogue was prepared, in which Arg at the C-terminal was 
replaced with the photoreactive Bpa. A Gin residue was changed to iodinatable 
Tyr. This residue together with Bpa and the Bpa-adjacent Phe form a hydro- 
phobic core deep inside the peptide binding domain. The photoprobe selectively 
labeled a 130-kDa glycoprotein with high efficiency [132]. 

A special guanylate cyclase receptor can be found in the nitric oxide (NO) sig- 
naling pathway. The activation of the sequence of events in that pathway results 
in smooth muscle relaxation. This pathway is directly linked to other cascades 
by receiving a Ca^"^ signal and utilizing calmodulin (GaM) as transmitter protein. 

Nitric oxide (NO), also termed endothelial-derived relaxing factor - EDRF - 
is a gaseous second messenger, which is released in blood vessels, neurones, 
macrophages, etc., by numerous extracellular stimuli and displays important 
biological functions. It has an extremely short half-life of 10 s. NO is released in 
endothelial cells, and causes primarily relaxation in neighboring smooth muscle 
cells. NO binds to the heme group of the soluble guanylate cyclase receptor 
within the cell and causes an immediate increase in the level of cGMP. The 
primary step in the NO signaling pathway is the event when NO-synthase (NOS) 
converts L-arginine to NO. That process requires Ga^+-bound calmodulin for 
activating NOS and a cofactor, tetrahidro-L-biopterin (H4Bip). An intimate 
H4Bip-heme-arginine interaction is a critical component in the catalytic activi- 
ty, and it was presumed that H4Bip facilitates an electron transfer during the 
reduction of the oxyferroheme complex. BP-linked photolabile antagonists (45, 
Fig. 15) were designed to identify the pterin contact region. PAL experiments 
allowed identification of a 341 -amino acid sequence in the oxygenase/dimeriza- 
tion domain of NOS. This finding will prove that H4Bip may act simply by 
stabilizing the dimer during L-arginine turnover without any implication in the 
electron transfer process [133]. 

4.4.3 

Signaling Pathways Involved in Immunomodulation 

T-cell antigen receptors belong to a separate family, which is composed of eight 
noncovalently bound trans-membrane subunits. The central ligand-binding 
element is a disulfide linked a,/I-heterodimer linked to the membrane by a short 
transmembrane sequence. The dimeric, variable subunits are the cellular equi- 
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valents of the antigen-specific immunoglobulin G antigen binding fragment 
(IgG Fab). The remaining “complementary determining” (GD2 and 3) chains 
consist of yand d-type glycoproteins and four other nonglycosylated subunits 
{e, e', Q, When a macrophage interaction delivers an antigen fragment to the 
receptor by the assistance of the major histocompatibility complex (MHG), the 
fragment is recognized and bound to these major subunits. Upon binding, the 
eight individual subunits closely associate, resulting in a tyrosine phosphoryla- 
tion in the cytoplasmic portions by a coupled kinase. The phosphorylated tyro- 
sines in the ^-subunit are the recognition element in the signaling pathway for 
the SH-2 domain of a protein kinase, which stimulates PLC y activity. The known 
hydrolysis pathway produces IP 3 and then Ca^"^ as already described. Luescher et 
al. studied extensively the T-cell receptor antigen ligand interactions with PAL 
[134]. Upon ligand binding, the MHG-protein-antigen ligand complex slowly 
dissociates and leaves the T-cell receptor. In order to control the ligand-receptor 
interaction, first the ligand was covalently linked to the MHG molecules, then 
cross-linked to the receptor. These two events were directed by a bidentate 
photoaffinity ligand using two aryl-azide photophores (46, Fig. 16), allowing a 
sequential activation at a different wavelength. First, the N-terminal of the anti- 
gen fragment, derived from Plasmodium berghei circumsporozoite ( YIPSAEKI), 
was linked with iodo-hydroxyl-aryl azides and upon irradiation at > 350 nm 
covalently attached to the K^-restrictive peptide (K^-QIO). That irradiation 
left unreacted the other photophore (unsubstituted aryl azide) attached to the 
Lys-259 residue in the short peptide. It was excited at < 350 nm and cross-links 
were created efficiently to the T-cell receptor. The labeling was inhibited by 
Kj-QlO specific antibodies which confirmed the binding at the anticipated site. 
Very recently the same group reported the above photoprobe labeled T-cell 
receptor residues (Tyr-48 and/or Tyr-50 in the CD2 region [135].) 

As explained, the immimomodulation signal starts with binding the antigen on 
T-cell receptor, which ultimately results in Ga^"^ release. Ca^"^ induces calmodulin 
activation, which in turn activates calcineurin, a Ga^'^-dependent phosphatase. 
This event leads to translocation of the cytoplasmic component of the transcrip- 
tion factor, which is required for IL-2 gene expression and T-cell activation. 

Gyclosporin A (GsA) and FK506 are immunosuppressive drugs, which allow 
the patient to overcome the normal immune-based rejection of foreign organs 
in organ transplantation. Immunophylin is a collective term for peptidyl-proline 
cis-trans isomerase proteins which bind with high affinity to immunosuppres- 
sant agents, such as cyclophylines that are CsA-binding proteins, and FKBP, that 
stands for FK506 binding proteins. CsA and FK506 inhibit T-cell proliferation at 
an early stage by preventing transcription of primarily interleukin-2 (IL-2). The 
CsA/cyclophylin and FK506/FKBP complexes bind to calcineurin and inhibit its 
activity, thus stopping the immune-signal. The immunophylin-calcineurin inter- 
face was mapped by Husi et al. with PAL [136]. 

Calcineurin is a heterodimer consisting of a large calmodulin-binding catalytic 
subunit A’ (61 kDa) and a Ca^'^-dependent subunit ‘B’ (19 kDa). A diazo-group con- 
taining photoreactive CsA (47, Fig. 16) was prepared after binding to cyclophylin 
and after incubation with calcineurin. Irradiation resulted in specific cross-link- 
ing mainly to the subunit ‘B’, with some minor interaction to subunit “A”. 



Photoaffinity Labeling in Biological Signal Transduction 



217 




46 




47 




48 



Fig. 16. Photoaffinity probes for studying signaling pathways in immunomodulation 
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Another class of immunosuppressant agents, e.g., rapamycin has a different 
mechanism. Although rapamycin binds to FKBP12, the complex does not 
interact with calcineurin. Alternatively, the rapamycin-FKBP complex binds to a 
specific rapamycin (sirolimus) effector protein (SEP). Since that SEP protein 
possesses homology to phosphatidyl inositol kinases, it has been suggested that 
rapamycin acts as an inhibitor of the kinase activity through the complex 
blocking further incompletely resolved signaling pathways. In order to map the 
ternary complex rapamydn-EKBP12 and the effector enzyme photoactivatable 
rapamycin analogues were prepared (C-31 and C-42 esters [48, Pig. 16]), which 
are currently under investigation. The C-42 modified analogues exhibited T-cell 
antiproliferative activity almost equipotent to the parent compound [137]. 

4.4.4 

Integrin Signaling 

Cell-adhesion molecules [138] are multifunctional molecules which are involv- 
ed in a number of cell-regulatory processes, including growth, differentiation, 
proliferation, and programmed cell-death. Adhesion proteins share a similar 
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Fig. 17. A photoaffinity probe for integrin signaling and photoactivatable membrane probes 
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structure and contain a conserved region called adhesion domain (AD). Cell- 
extracellular (ECM) interactions occur by involving cell-surface adhesion recep- 
tors, like integrin, which also function as signaling molecules. Focal adhesion 
biopolymers are rich in tyrosine phosphorylated proteins that are associated 
with receptors and signaling pathways. Ligand binding to integrin initiates 
focal adhesion, which transmits the signal by tyrosine phosphorylation to 
internal signaling molecules. Integrin is composed of two subunits, a and fi, 
both possessing a large N-terminal extracellular region and a short intra- 
cellular C-terminal sequence. There are a large number of combinations 
between 17 different a and 8 /1-subunits. According to the current receptor 
model, both subunits are necessary for the activation and the presence of 
and Mg is also essential. The metal binding domains are localized in 
the N-terminal part of the /1-subunit. The most common feature of integrins is 
that small peptides containing a defined sequence (RGD motif) bind to the 
receptor with high affinity. One family of these receptors is integrin 
(vironectin receptor), which plays an important role in angiogenesis, apoptosis, 
and particularly in bone resorption. In order to map the integrin-ligand inter- 
face conformationally constraint photoaffinity analogues were designed, where 
Bpa was placed close to the RGD motif at the G-terminus. Both radiolabeled and 
biotinylated (49, Fig. 17) probes were synthesized. A 100-kDa polypeptide was 
specifically labeled, which corresponded to the /I3 subunit. The cross-linking 
was dependent on the presence of the Ga^"^ and Mg^"^ ions. Limited digestion 
resulted in a 20-amino acid labeled fragment which was identified as a 
19-amino acid portion of the /I3 (99-118) subunit, located as the primary 
contact domain [139]. 



4.5 

Photoaffinity Labeling of Steroid Hormone Receptors and Membrane Probes 

As recently discussed, the majority of the cell surface receptor interactions starts 
a signal within the cell, which induces several intracellular responses and the 
signal ends at the nucleus initiating transcription/gene expression. Steroids 
partition in the cell membrane and readily pass through into the cytoplasm. 
They bind directly to nuclear receptors. Nuclear steroid receptors consist of four 
major domains: steroid binding domain is at the G-terminus; hinge domain is 
involved in nuclear localization; DNA-binding domain consists of two “zinc 
fingers,” which wrap around the DNA helix, and finally the regulatory domain 
activates gene-specific transcription factors. 

First, there are several photoaffinity approaches, which aim to identify ligands 
in the membrane phospholipid bilayer. Several groups reported lipid monolayer 
[140] or bilayer mimic probes with different photophores. In bilayer imitating 
probes the photophores (benzophenone - 50 [141], diazirine - 51 [142], Fig. 17), 
are placed in bridged position in the center. Using a monolayer photoprobe, 
Ourisson et al. [143] reported creating crosslinks to steroids (cholesterol ana- 
logues) while partitioning in the deep core of the membrane. It provided evidence 
that they can be readily transported to the cytoplasm without binding to surface 
receptors or using surface proteins for the internalization. 
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Binding to the nuclear receptors by steroid hormones have been illustrated 
with some examples. 

Katzenellenbogen studied several steroid receptors [21, 144]. Estrogen recep- 
tor is one of the most frequently investigated receptors by PAL. Hexestrol is a 
non-steroidal agonist, which was modified with a diazirine photophore via a 
sulfide linkage (52, Fig. 18). The photoprobe efficiently (30%) labeled the 




52 




53 





Fig. 18 . Photoaffinity probes for studying steroid receptors 
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receptor protein (60 kDa) [145]. Alternatively an anti-estrogen photoprobe was 
designed based on the structure of LY117018 (53, Fig. 18) [146]. 

Katzenellenbogen has also studied progesterone receptor with an aryl azide 
derivative of the hormone (54, Fig. 18). The extremely efficient (60%) photo- 
covalent attachment was identified in two subunits (B; 109 kDa, A: 87 kDa) in a 
ratio of 3.3; 1 [147]. 

In another recent example, Hashimoto reported photoaffinity experiments 
on retinoic acid receptors (RAR). Retinoic acid plays a critical role in cell pro- 
liferation and differentiation. RARs belong to the superfamily of nuclear/ 
thyroid hormone receptors. They consist of six transmembrane domains (A-F) 
which is a general feature of these receptors. The A/B domains have an auto- 
nomous transactivation function while the C-domain contains the Zn-finger, 
which binds to DNA. The large E-domain participates in ligand binding, di- 
merization, and ligand dependent transactivation. Finally, D- and F-domains 
help the orientation and stabilization of the E-domain. 

Photoreactive aryl azide linked retinoic acid derivative (termed ADAM-3, 55, 
Eig. 18) labeled efficiently the recombinant E-subunit and the covalent attach- 
ment was identified within residues 492 - 5 10 and 589 - 594, which correspond to 
similar sequences found in the human retinoic receptor [148]. 

5 

Summary and Future Directions 

In the fine regulation of the cellular functions, the recognition sites of related 
molecules and their binding affinities are different, allowing them to induce 
selective responses. On the other hand many of the second messengers, adaptor 
molecules or effector enzymes, interact with several different protein targets, 
which have multiple binding sites on regulatory proteins. As recently depicted, 
the “tangled web” of the signal transduction provides diverse opportunities 
for PAL techniques. Photochemistry provides mild, remotely controllable and 
selective tools for the discovery scientist. PAL techniques both help to identify 
multiple binding sites for a specific ligand, and provide valuable information 
about location, binding region and contact architecture of the interaction. They 
contribute to the discovery of the complete signaling machinery, which helps to 
develop specific drugs for restoring the normal cell function in aberrant signal- 
ing. Photoaffinity labeling is very efficient in complementing other techniques 
like site-directed mutagenesis. In the future new innovative applications are 
expected to evolve based on the experience gathered up to now in photo- 
chemistry and in molecular biology. It will also be accelerated with new sensitive 
analytical and separation techniques. 
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